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PLANT LIKE STARCHES AND THE METHOD OF MAKING THEM IN HOSTS 

The present application is based on U.S. Provisional Application No. 60/062,939, filed 
April 4, 1997, the entire contents of which is incorporated hereby by reference. 

BACKGROUND - FIELD OF INVENTION 

This invention relates to hosts containing constructs with genes from the starch 
pathway. More specifically the present invention relates to bacterial hosts that form plant like 
starches. Additionally the present invention relates to plant hosts that have genes from the 
starch pathway. The invention further relates to the starches produced by said hosts. 

BACKGROUND - DESCRIPTION OF PRIOR ART 

The starch using industry includes diverse industries such as candy makers, makers of 
adhesives and paints, gravy makers, paper producers, etc. Since the demand for starch, (which 
is formed of amylose and amylopectin), has been dramatically increasing for specialized food 
and industrial uses, efforts have been undertaken to tailor the quantity and quality of starch for 
specific food and industrial uses. 

This industry has overtime looked for a number of different starches having, high ■ 
viscosity, lower viscosity, higher gelling strength and lower gelling strength, different boiling 
points etc. Each starch tailored for a number of uses. The industry has utilized mutant starches 
that have less amylopectin and mutant starches with more amylose for tailored specifications. 
For example the increased amylose starch has been used in the gelled candy making area. And 
the industry has used the increased amylopectin starches formed by mutants such as wx and wx 
su2 containing little amylose and mostly amylopectin for thicken foods like pudding, pies, 
gravies, frozen foods and batters, stews, canned foods and baby food. Additionally the mutant 
starches of different types have usefulness as adhesives and as sizing. 

The other method used to address the industry needs for tailored starch is the use of 

chemical m odification of the starch. Chemical derivat io n of the starch are produced b y 

chemically reacting the starch with the monofunctional reagents to introduce the substituents 
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such as phosphate, acetate, succinate groups to stabilize the starch. Unfortunately, these types 
of starches can be subjected to government regulation and additionally have less acceptance 
generally due to the added cost of the starch. 

Starch is the major form in which carbohydrates are stored in biological systems. Plant 
starch in chloroplasts is transitory and storage starch accumulates in storage organs of many 
plant. Starch can be found in all organs of most higher plants including leaves stems and roots 
and fruits and embryo and endosperm. In addition to higher plants starch, similar 
polysaccharide (glycogen) has been found in bacteria. Many bacteria produce a reserve 
polysaccharide similar to the glycogen found in animals. 

Storage polysaccharide has been classified as being in two groups, group one has 
storage in the cytsol of the cell and the second group within the plastid. Escherichia coli 
produces a polysaccharide within the cytsol. Starch producing plants typically store starch in 
the plastids. Typical starch bearing plants include cassava, potato, corn, peas, rice, wheat, 
barley. The main starch storing tissue of corn, rice wheat and barley and oats, the cereal grains, 
is the endosperm. 

Starches are also classified by the plant source, for example cereal starches are from 
cereal grains such as maize, rice, wheat, barley, oats and sorghum; tuber and root starch are 
from potatoes and yams and cassava . 

The pathway of starch synthesis is not well understood. Generally, as noted above 
starch from plants, consists of two major components: amylose and amylopectin. These 
intertwine in the starch granule of the plants. Amylose is a linear polymer of alpha 1 -4 bonded 
anhydroglucose units while amylopectin is a branched polymer comprised of linear chains 
alpha 1-4 linked anhydroglucose units with branches resulting from alpha 1-6 linkages between 
the linear chains. It has been known for sometime that mutant genes in starch bearing plants 
can be expressed and that the properties of the starch can be altered. The proportion of the two 
components and their structures in the mutant primarily determine the physical-chemical 
properties of the starch. 
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Thus the lack of a clear understanding of the starch synthesis pathway and the difficulty 
of employing mutants limited the industry to the use of existing and producible mutant starches 
(cereals containing mutant starch can show a tremendous yield penalty in field environments) 
or to the chemical modification that could be made to the starch. In the last decade the industry 
has been studying the effects of certain starch genes in plants and bacteria in an attempt to 
more clearly understand starch synthesis.. Since the late 80's it has been possible to transform 
plants and bacteria to contain isolated genes. In response to this the industry has transformed 
potatoes with a bacterial gene GS and with starch soluble synthase III gene in the antisense 
(forming a mutant). As part of these potato starch experiments bacteria has been transformed 
with certain potato starch genes. For example the SSSIII gene from potato was transformed 
into E.coli deficient in glgA gene. The effect of glgC and branching enzyme I and II in 
combination in a mutant E.coli has also been studied and glycogen like product was reported. 
The starch industry that is commercial does not have a particular interest in the production of 
glycogen which is the polysaccharide produced by bacteria and animals ( the health care 
industry may have some such interest). The industry has thus not yet been able to generate 
tailored starches at reasonable prices through plant gene transformation. There remains a need 
for the industry to find new starches that axe useful due to their changed characteristics such as 
lower viscosity and new starches that are useful because of their higher viscosity and new 
methods of producing such starches. 

Glycogen synthesis in E. coli and starch synthesis in higher plants have similar 
pathways involving ADPGlc pyrophosphorylase, starch synthase,(SS) or glycogen synthase 
(GS,) and branching enzyme (BE). It has been suggested that ADPGlc pyrophosphorylase 
plays a pivotal role in regulating the amount of starch synthesized, while starch synthase and 
starch branching enzyme primarily determine the starch structure. Multiple forms of SBE and 
SS have been identified in many plants including maize, rice, pea and potato. In addition to the 
waxy gene coding for granule bound starch synthase (GBSS), three genes coding for the other 
forms of SS have been isolated from maize endosperm. Maize is the only cereal crop from 

which the genes coding for the five forms of SS have been isolated. Clearly a number of these 

sequences are published and known to those of skill in the art. Genes coding for maize SBE 
have also been cloned and characterized. Previous reports have demonstrated that maize SBEI 
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has a higher rate of branching amylose than SBEII and preferentially transfers longer chains, 
while SBEII shows a higher rate of branching amylopectin and preferentially transfers shorter 
chains. In comparison with SBE, less is known about the specificities and functions of multiple 
forms of SS. In Waxy maize, which lacks GBSS, only amylopectin is synthesized and amylose 
is missing. Therefore, it is generally accepted that GBSS, encoded by waxy gene, is primarily 
responsible for the synthesis of amylose. Study of waxy mutation in Chlamedomonas 
reinhardtii has suggested that GBSS is also involved in amylopectin synthesis. Although it has 
been reported that Chlamedomonas reinhardtii SSII controls the synthesis of intermediate size 
glucans of amylopectin in Chlamedomonas, direct evidence for the functions of SS in higher 
plants is still missing. Antisense technology has been used to study the functions of SS in 
potato, however, the results are inconclusive. 



In an article written by Hanping Guan et al., entitled AMaize Branching Enzyme 
Catalyzes Synthesis of Glycogen-like Polysaccharide in gig fl-deficient Escherichia coli@. 
Published in Proc. Natl. Acad. Sci. USA, Vol. 92, pp. 964-967, February 1995 Plant Biology a 
specific glycogen like polysaccharide from a transformed E coli was reported. This article 
taught the transformation of an E coli bacteria with maize BEI and BEII. These enzymes were 
transformed into two E coli hosts. One of the bacterial hosts was a wild type and the other was 
a mutant. The mutation to the bacteria was the reduction of the activity of glycogen BE in the 
AC71(gIgB-) so that the mutant was essentially free of BE activity. The paper analyzed the 
debranched alpha-glucan isolated from the four different transformants. The first host was E. 
coli containing glgB and the second host was the AC71 without any transformed genes then 
AC71 transformed with maize BEII, and then AC71 with maize BEI , then AC71 with maize 
BEI and BEII. The resultant polysaccharide products were analyzed by HPLC, by chain length 
and relative peak area and by mole distribution of chains. The study led to the understanding 
that BEII could play a role in synthesis of the short chains of amylopectin and BEI could be 
involved with the longer chains of amylopectin. The paper also noted that the mutant host 
AC71 produced more chains with chain length of 6 then did the wild type E. Coli. The paper 
also noted that the maize BE and the GS of the host did not produce amylopectin like 
polysaccharides. The article suggested that the concerted action of GS with different BE=s 
could play an important role in determining the final structure of the polysaccharide 
synthesized. The article by Guan ends by suggesting that his study had established the basis for 
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studying the concerted actions of BE and SS in a bacterial model system. 

The expression of E coli GS (glycogen synthases) in potatoes showed a large incidence 
of highly branched starch. This result was published in an article in Plant Physiol. 104,1159- 
1 166 by Shewaker et al. This potato does not appear to be of much commercial use at this 
time. 

The industry still needs the option of producing plant like starches in a fermentation 
process from bacteria and thus without the necessity of breeding and growing environment 
sensitive plants; and, the option of producing plants that generate the specific tailored starch 
through a plant host. And the industry needs altered and new starches that are cereal like 
starches or root and tuber like starches in large quantities and inexpensively thus avoiding 
having to use chemical modification of starch. The industry needs a host that can be readily 
transformed to supply different starches tailored to the industry=s need. Specifically the 
industry needs a host that supplies various different starches including those not capable of 
being made in plants or bacteria presently. 

OBJECTS AND ADVANTAGES 

Accordingly, several objects and advantages of the invention are to produce plant like 
starch through the process of fermentation. 

Additional objects and advantages are the production of new starches in plants. 

Still further objects and advantages will become apparent from a consideration of the 
ensuing description and accompanying drawings. 

Another object of the present invention is the synthesis of polysaccharides including 
amylose, amylopectin in E. colL and/or fiingal and yeast by plant starch synthesizing enzymes 
including SS, SBE, bacterial branching enzyme, glycogen synthase and other enzymes in other 
living organism. 



Yet another object of my invention is using each or combination of these enzymes or 
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modified enzymes studied in this patent to produce or to improve polysaccharides in any living 
organism including starch synthesis in plants. 

SUMMARY OF THE INVENTION 

The invention provides DNA constructs in a host that include most of the genes in the 
starch pathway of a plant such that the host produces a plant like polysaccharide. And in one 
embodiment produces maize starch including slightly different embodiments that make 
specific maize mutant like starch in a non plant host. This invention encompasses a bacterial 
host containing a combination of two or more of such genes SSI, SSSIIa, SSIIb, SSSIII, GBSS, 
BEI and BEII when the combination does not form glycogen like material. This invention 
encompasses a plant host transformed with any of the following maize genes or a plant host 
having a combination of two or more of the following maize genes SSI, SSIIa. SSIIb, SSSIII, 
GBSS, BEI and BEII in a hybrid or an inbred rice plant. 

Additionally the present invention includes new' polysaccharide produced by a 
transformed host. The host having a wildtype, which does not produce the new 
polysaccharide, the transformed host expressing at least two exogenous starch synthesis genes, 
the genes are selected from a group consisting of starch synthesis genes such as SSI SSIIa, 
SSIIb, SSIII, GBSS and optionally including at least one of the BEI and BEII genes wherein 
the transformed host is capable of producing such new polysaccharide. 

The invention also covers a new polysaccharide wherein the host also expresses the 
exogenous genes selected from the following group consisting of bacterial glycogen inducing 
genes are selected from the group glgA, glgB, glgC and any mutants thereof. Or wherein the 
host also expresses the exogenous genes selected from the following group consisting of plant 
granule bound enzymes. And the new polysaccharide wherein the starch synthesis genes are 
selected from the group consisting of BEI and BEII. 

The present invention broadly encompasses a host containing a transformed Gig C gene 
and at least one of the starch branching enzymes genes in a host in combination with at least 
one other transformed starch gene wherein the host produces a polysaccharide product. And a 
host containing transformed bacterial gene and at least one of the non starch branching 
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enzymes selected from the group consisting of debranching enzymes and soluble starch 
synthase 

A method of producing polysaccharides which are non glycogen like in a host 
comprising transforming a host capable of being used in a fermentation process, with genes 
selected from the group which produce starch synthesizing enzymes, or glycogen synthesizing 
enzymes such that the host produces nonglycogen like starch, and employing the host in a 
fermentation process that produces polysaccharides. The host is bacteria, or a fungal or a 
yeast. Additionally the method of this invention includes the use of bacterial genes also such 
as the glycogen synthesizing genes including the glgC, glgA, glgB genes. A method wherein 
the genes which produce starch synthesizing enzymes include genes encoding for starch 
soluble synthases I, Ha, lib and SS III (dull). A method wherein the genes which produce 
starch synthesizing enzymes include genes encoding for starch debranching enzyme and 
branching enzymes. The invention covers the modified starch synthesizing enzymes 
including the N-terminally truncated SS. 

In other word the invention covers a host transformed to carry a gene active in glycogen 
production, and at least one nonstarch branching gene active in the production of at least one of 
the following polysaccharides amylopectin and amylose in its original host. The host can be a 
monocot or a dicot plant. The host can be a cereal bearing plant. Or the host can be a 
bacteria. 

More specifically the invention includes a host wherein at least one nonstarch genes 
active in the production of at least one of the following polysaccharides, amylopectin and 
amylose in its original plant, is selected from the group consisting of starch soluble starch 
synthase I, Iia, lib, III genes and debranching enzyme gene (sul), GBSS gene, sh2 gene and 
bt2 gene. A host including at least one of the starch branching enzyme genes such as BEI 
gene, BEII gene. 

The present invention can also be described as a host t ransformed to carry a gene active 

in ADPG production, and at least one starch gene active in the production of at least one of the 
following polysaccharides amylopectin and amylose in its original host wherein the host 



BNSDOCID: <WO 9844780A1 I > 



WO 98/44780 PCT7US98/06660 

8 

produces polysaccharides that are plant like starch and not glycogen like. 

Additionally the host can be transformed to carry a pyrophosphorylase gene, and 
glycogen synthase gene. 

The scope of the present invention includes a host deficient in alpha 1,4 glucan 
synthesizing ability and alpha 1,4-1,6 branching enzyme capability transformed to express at 
least one a plant starch soluble synthesis gene. And the host can also include being 
transformed to express at least one gene encoding for debranching enzyme, and/or a gene 
encoding for starch soluble synthase I, starch soluble synthase enzyme Iia, lib, starch soluble 
synthase enzyme III. This host can including being transformed to express at least one gene 
encoding for starch branching enzyme. 

This invention also includes the production of a glycogen like material in plants. 
Attached hereto are a number of plasmids described by the figures and by table one, that are 
part of the present invention and are claimed herein. One such example is the plasmid wherein 
the plasmid is in a carrier host and the plasmid contains the SSIIa gene with the n terminus 
GENVMNVIW and wherein the gene is approximately 1561 base pairs in length. The 
invention includes mutant hosts such as mutant plants like waxy rice and potatoes and corn as 
example and wherein the host is a mutant E. Coli, or fungus. 



BRIEF DESCRIPTION OF THE DRAWTNCS 

FIG. 1 shows a graph which gives the relative peak area in percent and the chain length 
of glycogen and starch soluble synthase I (SSI), starch soluble synthase II (SSIIa), starch 
soluble synthase lib (SSIIb). Thus this shows the specificities of Maize SS=s in chain 
elongation. 

FIG. 2 shows plasmid pEXSC-MBEI with 7661 base pairs and promoter T7 and a 
Kanamycin gene and glgC and the maize starch branching enzyme I (MBEI). 

FIG. 3 shows plasmid pEXSC3C with 7461 base pairs and promoter T7 and ampicillin 
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gene and the maize starch soluble synthase gene Iia. pEXS3c is the 1082 bp Nde I-EcoRI 
fragment containing the N-terminus of MSSSIIa (from MSSIIa in pBSK) subcloned into the 
Nde I-EcoRI sites of pEXS3a, replacing the N-terminus of IIA-2 with the longer Iia N- 
terminus. MSSIIa is the mature maize SSIIa and is 2090 bp long. The following sites are not 
contained in the MSSIIc insert: Apa I, Bglll, Eco V, Not, Spe I, and Xba I. The N-terminus of 
this plasmid is AEAEAGGKD. 

FIG. 4 shows plasmid pEXSC-MBEI-MBEII with 9971 base pairs and promoter T7 and 
a Kanamycin gene and glgC and the maize starch branching enzyme I (MBEI) and the maize 
starch branching enzyme II (MBEII). 

FIG. 5 shows plasmid pEXSC-MBEII with 7521 base pairs and promoter T7 and a 
Kanamycin gene and glgC and the maize starch branching enzyme II (MBEII). 

FIG. 6 shows plasmid pEXSC-3a with 7990 base pairs and promoter T7 and a 
Kanamycin gene and the glgC gene and the maize N-terminally truncated starch synthase gene 
Ha (MSSIIa-2). The N-terminal sequence is GENVMNVI. 

FIG. 7 shows plasmid pEXSC-8 with 7079 base pairs and promoter T7 and a 
Kanamycin gene and the glgC gene and the maize starch soluble synthase gene I and version I- 
2.(MSSI-2)> An N-terminally truncated SSI. 

FIG. 8 shows plasmid pEXSC-9 with 7551 base pairs and promoter T7 and a 
Kanamycin gene and the glgC gene and the maize starch soluble synthase gene lib (SSIIb). 
The N-terminal sequence is AAAPAGEE. 

FIG. 9 shows plasmid pEXSC-10 with 7211 base pairs and promoter T7 and a 
Kanamycin gene and the glgC gene and the maize starch soluble synthase gene I, the full 
length SSI. The N-terminal sequence is CVAELSREGPA 



FIG. 10 shows plasmid pEXSCA with 6738 base pairs and promoter T7 and a 
Kanamycin gene and the glgC gene and the gig A gene. 
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FIG. 1 1 shows plasmid pEXSC9a with 7240 base pairs and promoter T7 and ampicillin 
gene and the maize starch soluble synthase gene IIb-2 (Maize SS IIb-2), an N-terminally- 
truncated SSIIb. The N-terminal sequence is MNVVWASECAP. 

FIG. 12 shows plasmid pEXSWX with 6968 base pairs and promoter T7 and an 
ampicillin gene and the N-terminally-truncated maize WX (maize granular bound starch 
synthase). The N-terminal sequence for wx is ASAGMNVVFVGAEMA. 

FIG. 13 shows plasmid pEXSWX2 with 6980 base pairs and promoter T7 and an 
ampicillin gene and the N-terminally-truncated maize WX termed as wx2. The N-terminus of 
wx2 is MNWFVGAEMA. 

FIG. 14 shows plasmid pEXSC9 with 7780 base pairs and promoter T7 and ampicillin 
gene and E. coli glgC gene and the maize starch soluble synthase genellb (Maize SS lib). 

FIG. 15 shows plasmid pEXSClOd with 7112 base pairs and promoter T7 and 
ampicillin gene, E. coli glgC gene and the N-terminally-truncated maize starch soluble 
synthase gene I termed as Maize SSI-3). The N-terminus of maize SSI-3 is 
MSIVFTGEASPYA. 

FIG. 16 shows plasmid pEXSlO with 5300 base pairs and promoter T7 and ampicillin 
gene and the full length maize starch soluble synthase gene I termed as Maize SS I. 

FIG. 17 shows plasmid pEXS8 with 7259 base pairs and promoter T7 and ampicillin 
gene and the N-terminally-truncated maize starch synthase gene I termed as SSI-2. The N- 
terminal sequence is CVAELSRDLGLEPEG. 

FIG. 18 shows plasmid pEXSCAl with 5128 base pairs and promoter T7 and 
ampicillin gene and the glgA. pESCAl is a 1551 bp Spel-Sac I fragment containing glgA 
(from glgA in pBSK) subcloned into the Xba I- Sac I sites of p ET-23d which is commercially 
available from Novagen in Madision Wisconsin under catalog number 69748-1 and called ET- 
23d(+) DNA. 
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FIG. 19 shows the spectrum of the iodine glucan complex of the product produced by 
the host containing the glgC and glgA, and the pEXSC9, pEXSC3, pEXSC8, pEXSCwx the 
X-axis is listing run and the Y axis is reading absorbance. 

FIG. 20 shows the spectrum of the iodine glucan complex of the product produced by 
the host transformed with plasmids containing the glgC, the BEI, the BEII genes and glgA; 
glgC, the BEI, the BEII genes and maize SSI, SSI-2 and glgC, the BEI, the BEII genes and 
maize SSIIb, and glgC, the BEI, the BEII genes and maize SSIIa-2, and glgC, the BEI, the 
BEII genes, the X-axis is listing nm and the Y axis is reading absorbance. 

FIG. 21 shows the product produced by the host in small bottles including the product 
from the host containing glgC, the BEI, the BEII genes and maize SS=s genes. Encoded as (C- 
I-II+8), glgC, the BEI, the BEII and maize SSI-2 genes and pEXSCIO encoded as (C-I-II+10V 
glgC, the BEL the BEII and maize SSI genes and pEXSC9 encoded as (C-I-II+9),glgC, the 
BEI, the BEII and maize SSIIb genes and pEXSC3a encoded as (C-I-II+3a), glgC, the BEI, the 
BEII and maize SSIIa-2 genes and pEXSCWX encoded as (C-I-II+WX),glgC, the BEI, the 
BEII and maize waxy genes and pEXSCAl encoded as (C-I-II+A1), containing maize BEI, 1 
BEII and E.coli glgA genes, potato dextrin, waxy maize starch, corn amylopectin, rice starch, 
com starch, pEXSC8. 

FIG. 22 shows pExs-trc has 4178 base pairs with the trc promoter and the ampicillin 
gene. PEXS trc is pTrc99A-Ndel which has been mutagenixed. (Nco I site in multiple cloning 
site of p Trc99A-Ndel is mutagenixed to Nde I using primers EXS63 AND EXS64.) pEXS-trc 
contains only one Nde I site and no Nco I sites. The following sites are not contained in 
pEXS-trc; Bgl II, Cla I, Nco I, Not I, Sac II, SnaB I, Spe I, and Xho I. 

FIG. 23 shows pEXS-trc3 has 4129 base pairs with the trc promoter and the ampicillin 
gene in partial and the Kanamycin gene. The pEXS-trc3 is pEXS-trcl cut with Bgll (filled in)- 
Sea I and religated, deleting most of the Amp gene (304 nt from the 5" end remain). The 
following sites are Not contained in p EXS-trc3: A pa I, B gl II, Eco V , Nco I, Not I._SnaB_L_ 
and Spe I. 
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FIG. 24 shows the plasmid pEXS 102 having 7190 base pairs, adapted for plant 
transformation containing the maize 10KD zein promoter, and maize adh I intron, the gene 
coding for the maize starch synthase I transit peptide, and the Waxy 2 gene and the nos 
terminator and the ampicillin gene. 

FIG. 25 shows the plasmid pEXS 103 having 6607 base pairs, adapted for plant use 
containing the maize 10KD zein promoter,the gene coding for the maize starch synthase I 
transit peptide and the Waxy 2 gene and the nos terminator and the ampicillin gene. 

FIG. 26 shows the plasmid pEXS 101 having 6979 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide and the gig B gene and the nos terminator and the ampicillin gene. 

FIG. 27 shows the plasmid pEXS 100 having 7557 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and the gig B gene and the nos terminator and the 
ampicillin gene. 

FIG. 28 shows the plasmid pEXS 101 having 6273 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide, and the gig A gene and the nos terminator and the ampicillin gene. 

FIG. 29 shows the plasmid pEXS 66 having 6001 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide, and the gig C 3 gene and the nos terminator and the ampicillin gene. 

FIG. 30 shows the plasmid pEXS 65 having 6373 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide, and the maize waxy gene and the nos terminator and the ampicillin gene. 

FIG. 31 shows the plasmid pEXS 64 having 7073 base pairs, adapted for plant use 
containing the maize 10KD zein promoter.the gene coding for the maize starch synthase I 
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transit peptide, and the maize soluble starch synthase Ha gene and the nos terminator and the 
ampicillin gene. 

FIG. 32 shows the plasmid pEXS 63 having 6473 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide, and the maize soluble starch synthase Ha gene and the nos terminator and the 
ampicillin gene. 

FIG. 33 shows the plasmid pEXS 62 having 6773 base pairs, adapted for plant use 
containing the maize 10KD zein promoter,the gene coding for the maize starch synthase I 
transit peptide, and the maize soluble starch synthase 1-2 gene and the nos terminator and the 
ampicillin gene 

FIG. 34 shows the plasmid pEXS 61 having 7013 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, the gene coding for the maize starch synthase I 
transit peptide, and the maize soluble starch synthase lib gene and the nos terminator and the 
ampicillin gene. 

FIG. 35 shows the plasmid pEXS 59 having 6858 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and the E.coli glgA gene and the nos terminator and the 
ampicillin gene 

FIG. 36 shows the plasmid pEXS 58 having 7658 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and the maize soluble starch synthase Ha gene and the 
nos terminator and the ampicillin gene. 

FIG. 37 shows the plasmid pEXS 56 having 6586 base pairs, adapted for plant use 
containing the maize 10KD zein promoter , and maize adh I intron. the gene coding for-the- 
maize starch synthase I transit peptide, and the gig C 3 gene and the nos terminator and the 
ampicillin gene. 
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FIG. 38 shows the plasmid pEXS 54 having 7658 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and the Maize SS Ha gene and the nos terminator and 
the ampicillin gene. 

FIG. 39 shows the plasmid pEXS 53 having 7058 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and the maize starch soluble synthase IIa-2 gene and the 
nos terminator and the ampicillin gene. 

FIG. 40 shows the plasmid pEXS 52 having 7358 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and maize starch soluble synthase 1-2 gene and the nos 
terminator and the ampicillin gene. 

FIG 41 shows the plasmid pEXS 51 having 7398 base pairs, adapted for plant use 
containing the maize 10KD zein promoter, and maize adh I intron, the gene coding for the 
maize starch synthase I transit peptide, and maize starch soluble synthase lib gene and the nos 
terminator and the ampicillin gene. 

FIG. 42 shows photograph of eleven products of altered starch produced with the 
present invention. The titled are encoded C-I-II=the glgC gene and the BEI and the BEII genes 
the following number or alternatively designation means pEXS-and the number. Thus C-I- 
II=the glgC gene and the BEI and the BEII and EXS-10 plasmid that contains the gene SSI, 
having the N-terminus shown in Table 1. 

FIG. 43 shows the DNA sequence and the protein sequence for glgA having 1488 base 

pairs. 

FIG. 44 shows the DNA sequence and the protein sequence for glgB having 2361 base 

pairs. 



BNSDOCID: <WO 9844780 A 1 I > 



WO 98/44780 



15 



PCT/US98/06660 



FIG. 45a shows the DNA sequence for Zea mays 10-kDa zein gene having 2562 base 

pairs . 

FIG. 45b shows the DNA sequence for Zea mays 10-kDa zein portion of the gene used 
as the promoter in a number of the plasmids discussed herein. 

FIG. 46 shows the DNA sequence and the protein sequence for glgC3 (glgC 3 ) having 
1328 base pairs containing two mutations P295D, K296E. This is a mutant of the wild type 
glgC gene. 

FIG. 47 shows the DNA sequence and the protein sequence for glgC (glgC) having 
1328 base pairs. 

FIG. 48 shows the DNA sequence and the protein sequence for glgCwt (glgCwt) having 
1328 base pairs. This is the glgC gene that is found in nature. 

FIG. 49 shows the DNA sequence and the protein sequence for the maize waxy gene 
denoted wx herein. 

FIG. 50 shows the DNA sequence and the protein sequence for the maize starch soluble 
synthase lib encoding gene having 2423 base pairs. 

FIG. 51 shows the DNA sequence and the protein sequence for the maize starch soluble 
synthase Ha. 

FIG. 52 shows the DNA sequence and the protein sequence for the maize starch soluble 
synthase 1-2 having 1749 base pairs. 

FIG. 53 shows the DNA sequence and the protein sequence for the maize branchin g 

enzyme II. 
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FIG. 54 shows the DNA sequence and the protein sequence for the maize branching 
enzyme I. 

FIG. 55 shows the DNA sequence and the protein sequence (153) for the transit peptide 
portion of the maize starch soluble synthase I. 

FIG. 56, PCR analysis of transgenic rice plants. The genomic DNA isolated from rice 
plants were PCR amplified using specific primers for the inserted gene. The specific bands 
were identified on 1% agarose gel compared with non-transgenic rice plant. 

FIG. 57. Activity staining of starch synthase on renaturing SDS-PAGE gel with Iodine 
solution: The positive staining of maize SSI-2 indicated the expression of maize SSI-2 in 
transgenic rice plants. 

FIG. 58. SSI-1, SSI-2, and SSI-3 construct diagram. Three forms of SSI were 
constructed in the pET expression system (see Methods). pExslOa encodes SSI-1, the full 
length maize SSI (583 amino acids). P Exs8 encodes a truncated SSI, SSI-2, with amino acids 
#8-52 deleted from the N-terminus of SSI-1. pExsld encodes the most truncated form of SSI, 
SSI-3, with the first 93 amino acids deleted from SSI-1. A depiction of the waxy gene, 
encoding GBSS, is also included for comparison. The amino acid motif KS/TGGL, the 
putative binding site for ADPGlc, is indicated by the triangles. The KS/TGGL motif is located 
18 amino acids from the N-terminus in GBSS, while the motif is 106 amino acids from the N- 
terminus in maize SSI. Drawing not to scale. 

FIG. 59. SSIIa-1 and SSIIa-2 construct diagram. Two forms of SSIIa were constructed 
in the pET expression system. pExs3c encodes SSIIb-1, the putative full length maize SSIIb. 
N-terminal sequencing of SSIIa- 1 revealed that the polypeptide chain started at amino acid #1, 
so the length of SSIIa-1 is 669 amino acids. pExs3a encodes a truncated form of SSIIa, SSIIa- 
2, with the first 176 N-terminal amino acids deleted from SSIIa (493 amino acids total). A 
depiction of the waxy gene, encoding GBSS, is also included for comparison. The amino acid 
motif KTGGL, the putative binding site for ADPGlc, is indicated by the triangles. The 
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KTGGL motif is located 18 amino acids from the N-terminus in GBSS, while the motif is 194 
amino acids from the N-terminus in maize SSIIa. 



FIG. 60. SSIIb-I and SSIIb-2 construct diagram. Two forms of SSIIb were constructed 
in the pET expression system (see Methods). pExs9 encodes SSIIb- 1, the putative full length 
maize SSIIb. N-terminal sequencing of SSIIb- 1 revealed that the polypeptide chain started at 
amino acid #1, so the length of SSIIb- 1 is 637 amino acids. pExs9a encodes a truncated form 
of SSIIb, SSIIb-2, with the first 144 N-terminal amino acids deleted from SSIIb (492 amino 
acids total). A depiction of the waxy gene, encoding GBSS, is also included for comparison. 
The amino acid motif KTGGL, the putative binding site for ADPGlc, is indicated by the 
triangles. The KTGGL motif is located 18 amino acids from the N-terminus in GBSS, while 
the motif is 158 amino acids from the N-terminus in maize SSIIb. 



FIG. 61. Temperature Curves for SSI enzymes. All assay components, except enzyme 
and [U- 14 C]-ADPGlc, were mixed and then preincubated at each temperature for 3 minutes 
before addition of enzyme and ADPGlc. For all assays, the final concentration of [U- 14 C]- 
ADPGlc was 3 mM, while amylopectin was 6 mg/ml. Each point is an average of three 
separate determinations. 

FIG.62. Temperature Optima of SSIIa- 1 and SSIIa-2. All assay components, except 
enzyme and [U- ,4 C]-ADPGlc, were mixed and then preincubated at each temperature for 3 
minutes before addition of enzyme and ADPGlc. For assays in the presence of 0.5 M citrate, 5 
mg/ml amylopectin was used as primer. For assays without citrate, 10 mg/ml amylopectin was 
used. For all assays, the concentration of [U- ,4 C]-ADPGlc was 3 mM. Each point is an 
average of three separate determinations. 



FIG. 63. Temperature Optima of SSIIb-! and SSIIb-2. All assay components, except 
enzyme and [U- ,4 C]ADPGlc, were mixed and then preincubated at each temperature for 3 
minutes before addition of enzyme and ADPGlc. For all assays, the concentration of [U- 

14 C]ADPGlc was 3 mM and the concentration of glycog en was 40 mg/ml. Each_point_is_an. 

average of three separate determinations. 



BNSDOCID: <WO_9844780A1_I_> 



WO 98/44780 ^£ PCT/US98/06660 

18 



PREFERRED EMBODIMENT - DESCRIPTION 

Gene shall mean the entire gene sequence or any mutations or varieties of the codon 
that produce the desired activity in the host or alternatively the section or sections of the gene 
sequence necessary to produce the desired activity in the host. For example glgC gene shall 
mean glgC 16 , glgC 3 and other mutants that produce the desired activity in the host. Starch 
synthase gene shall mean full length SS, N-terminally-truncated SS or mutated SS with 
starch synthase activity. 

Glycogen like-shall mean polysaccharide material such as those produced as the main 
starch product by E.coli in its native state and by the hosts as taught in the above described 
paper by Hanping Guan. 

Non Glycogen like- shall mean polysaccharide material which is plant like and is not 
produced as the main starch product by E.coli in its native state and by the hosts as taught in 
the above described paper by Hanping Guan. 

Plant like starch- is non glycogen like. 

Transformed gene-shall mean a gene that was somewhere in the lineage of the plant or 
bacteria introduced into the plant by means other than nature. Thus the progeny of a 
transformed host would continue to contain a transformed gene. 

Transformed host- shall mean any organism containing one or more of the novel 
plasmids and/or a novel combination of starch synthetic genes discussed herein. 
Within this application a number of different protocols have been employed to designate the 
same gene or synthase. MSS#=maize soluble starch synthase, SS#will likewise mean starch 
synthase though not necessarily maize. STS#will also designate soluble starch synthase. 
GBSS=granule bound starch synthase. SBE#=starch branching enzyme, MBE= maize starch 
branching enzyme, MSBE#=maize starch branching enzyme, and BE#=starch branching 
enzyme. 
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The present invention broadly encompasses transforming hosts such as bacteria or 
plants with plant starch synthetic genes that produce a non glycogen like material (a bacteria 
containing BEI and BEII from maize produces a glycogen like material). Starch bearing plants 
and organisms hereinafter are referred to as the host. One of the primary aspects of this 
invention is the generation of plant like starch from a bacterial host and the production of 
altered starch in a plant host. The present invention has been exemplified in both bacteria and 
in transformed rice plants. The host can contain though it is not a limitation, an unlimited 
supply of ADPG from the addition of the glgC gene (the bacterial gene) to the plant. 
Additionally the present invention encompasses plasmids that contain the maize genes and/or 
the bacterial genes in a construct adapted for use in a bacteria and constructs adapted for use in 
a plant. The plasmids in the plant construct preferably containing an active promoter 
recognized by the plant, a transit peptide, and the cleavage site that permits the protein to 
cleave from the transit peptide when crossing into the amyloplast in the plant. The plasmids 
used in the rice transformation specifically encompassed the maize 10 kd zein promoter, and 
the transit peptide from the maize SSI gene in the constructs adapted for plant use. The present 
invention also encompasses the plant producing the altered starch in the starch storage section 
of the plant or within the host cell and the altered starch itself. Additionally the present 
invention encompasses the combination of a number of starch genes in combination being 
active in a host such that the host produces differing non glycogen polysaccharides. Still 
further the present invention encompasses a method of making plant like starch in a bacterial 
host and the method of making altered plant like starch (altered in relationship to the type or 
amount of starch that the host makes without the constructs containing the genes), in a plant. 
Yet another object of the present invention is the addition of a gene that encodes for the 
substrate ADPG used to form starch. 

The present invention encompasses a plasmid or combination of plasmids in the same 
host having a promoter adapted for use in a plant and a gene encoding for ADPGlc 
Pyrophosphoroylase, preferably a bacterial gene, and a gene encoding for starch synthase I or 

its mutant form. The present invention also encompasses the combination o f a promoter 

adapted for use in a plant and optionally a gene encoding for ADPGlc Pyrophosphoroylase, 
preferably a bacterial gene, and a gene encoding for starch synthase I or its mutant form, and at 
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least one gene encoding for branching enzyme transformed into a plant host. 

The present invention encompasses a plasmid or combination of plasmids in the same 
host having a promoter adapted for use in a plant and a gene encoding for ADPGlc 
pyrophosphorylase, preferably a bacterial gene, and a gene encoding for starch synthase Iia or 
its mutant form. The present invention also encompasses the combination of a promoter 
adapted for use in a plant and optionally a gene encoding for ADPGlc pyrophosphorylase, and 
a gene encoding for starch synthase Iia or its mutant form, and at least one gene encoding for 
branching enzyme transformed in to a plant host. 



The present invention encompasses a plasmid having a promoter adapted for use in a 
plant and a gene encoding for ADPGlc pyrophosphorylase, preferably a bacterial gene, and a 
gene encoding for starch synthase lib and its mutant form. The present invention also 
encompasses the combination for a promoter adapted of use in a plant and an optional gene 
encoding for ADPGlc pyrophosphorylase, and a gene encoding for starch synthase lib or its 
mutant form and at least one gene encoding for branching enzyme transformed in to a plant 
host. 

The present invention encompasses a plasmid having a promoter adapted for use in a 
plant and a gene encoding for Pyrophosphoroylase, preferably a bacterial gene, and genes 
encoding for at least one of the following genes starch synthase I, starch synthase Iia, starch 
synthase lib, DU1. The present invention also encompasses the combination of a promoter 
adapted for use in a plant and a gene encoding for ADPGlc Pyrophosphoroylase, preferably a 
bacterial gene, and genes encoding for at least one of the following genes starch synthase I, 
starch synthase Iia, starch synthase lib and DU1, , and at least one gene encoding for branching 
enzyme transformed in to a plant host. 



The present invention encompasses a plasmid or combination of plasmids in the same 
host having a promoter adapted for use in a plant and a gene encoding for ADPGlc 
Pyrophosphoroylase, preferably a bacterial gene, and genes encoding for at least one of the 
following genes starch synthase I, starch synthase Iia, lib and starch synthase III (DU1). The 
present invention also encompasses the combination of a promoter adapted for use in a plant 
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and an optional gene encoding for ADPGlc pyrophosphorylase, preferably a bacterial gene, and 
genes encoding for at least one of the following genes starch synthase I, starch synthase Ha, lib 
starch synthase III(DUl), and at least one gene encoding for branching enzyme, and at least 
one gene encoding for the debranching enzyme transformed in to a plant host. 

The present invention encompasses a plasmid or combination of plasmids in the host 
having a promoter adapted for use in a bacteria or yeast and a gene encoding for ADPGlc 
Pyrophosphoroylase, preferably a bacterial gene, and a gene encoding for starch synthase I. 
The present invention also encompasses the combination of a promoter adapted for use in a 
bacteria or yeast and a gene encoding for ADPGlc Pyrophosphoroylase, preferably a bacterial 
gene, and a gene encoding for starch synthase I, and at least one gene encoding for branching 
enzyme transformed in to a bacteria or yeast host. 

The present invention encompasses a plasmid or combination of plasmids in the host 
having a promoter adapted for use in a bacteria or yeast and a gene encoding for 
Pyrophosphoroylase, preferably a bacterial gene, and a gene encoding for starch synthase Ila. 
The present invention also encompasses the combination of a promoter adapted for use in a 
bacteria or yeast and optionally a gene encoding for ADPGlc Pyrophosphoroylase, preferably a 
bacterial gene, and a gene encoding for starch synthase Ha, and at least one gene encoding for 
branching enzyme transformed in to a bacteria or yeast host. 

The present invention encompasses a plasmid or combination of plasmids in the same 
host having a promoter adapted for use in a bacteria or yeast, and a maize gene encoding for 
starch synthase III(DUl). The present invention also encompasses the combination of a 
promoter adapted for use in a bacteria or yeast and an optional gene encoding for ADPGlc 
Pyrophosphoroylase, preferably a bacterial gene, and a gene encoding for starch synthase III, 
and at least one gene encoding for branching enzyme transformed in to a bacteria or yeast host. 

The present invention encompasses a plasmid or combination of plasmids in the same 
host havin g a promoter_adapted_fQr_use_in_bacteri 

for at least one of the following genes starch synthase I, starch synthase IIa,IIb, starch synthase 
III(DUl). The present invention also encompasses the combination of a promoter adapted for 
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use in bacteria or in yeast and a gene encoding for ADPGlc Pyrophosphoroylase, preferably a 
bacterial gene, and genes encoding for at least one of the following genes starch synthase I, 
starch synthase IlaJIb, starch synthase III, and at least one gene encoding for branching 
enzyme transformed in to bacteria or into yeast hosts. 

The present invention encompasses a plasmid Or combination of plasmids in the same 
host having a promoter adapted for use in bacteria or in yeast and a gene encoding for ADPGlc 
Pyrophosphoroylase, preferably a bacterial gene, and genes encoding for at least one of the 
following genes starch synthase I, starch synthase HaJIb, starch synthase III The present 
invention also encompasses the combination of a promoter adapted for use in bacteria or in 
yeast and a gene encoding for Pyrophosphoroylase, preferably a bacterial gene, and genes 
encoding for at least one of the following genes starch synthase I, starch synthase IlaJIb, starch 
synthase III(DU1), and at least one gene encoding for branching enzyme, and at least one gene 
encoding for the debranching enzyme transformed in to a bacteria or into a yeast host. 

The present invention encompasses the truncated versions of the SSI and the SSII and 
the SSIII genes that still provide protein that is sufficient to make the polysaccharide. 
By transforming different combinations of SS and SBE into E. coli HPG204(DE3) or G6MD3 
defective in GS and GBE, we obtained the first evidence that maize SSI, SSII and SSIII have 
different specificities in the size of glucans synthesized see fig one.. Herein, we present the 
model system to produce differing polysaccharides from hosts with SS and SBE in E.coli by 
metabolic engineering. We also demonstrated that the truncated forms of SS had different 
Vmax, temperature stability and kinetic properties (Table, Fig). 

We also demonstrated that transformation of starch synthase and/or branching enzyme 
in E .coli resulted in production of polysaccharides differing in size and structure. These 
polysaccharides can be used in food and nonfood industries to replace and/or complement 
starch functionalities. A large amount of these polysaccharides can be produced with 
fermentation technology. 

Starch biosynthesis in higher plants and glycogen biosynthesis in E. coli have similar 
reactions which use adenosine diphosphate glucose (ADPGlc) as a substrate. This similarity 
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allows us to use plant starch synthase (SS) and starch branching enzyme (SBE) to complement 
the functions of glycogen synthase (GS) and glycogen branching enzyme (GBE) in E. coli 
G6MD3, which is deficient in GS and GBE. Transformation of E. coli glgC gene and maize 
starch synthase gene in Exoli G6MD3 produced linear a 1,4 glucan similar to amylose. 
coexpression of the glgC, maize starch synthase and maize branching enzyme produced 
branched polysaccharides. However, distinct properties of plant starch branching enzyme and 
starch synthase make it possible to synthesize different polysaccharides in E.colL While maize 
SSI preferentially synthesis short chains (dp 6 - 15), SSII and SSIII preferentially transferred 
long chains (dp > 24) and intermediate chains (dp 16 - 24) respectively. Transformation of 
different maize starch synthases, E. coli glycogen synthase (glgA) and/or maize branching 
enzymes into E.coli HPG96 or E. coli G6MD3 resulted in the synthesis of different sizes of 
polysaccharide with DP 500-4000.These polysaccharides synthesized in E .coli by maize SS 
have different physical-chemical properties than polysaccharides synthesized in natural 
organisms including starch from plant sources and glycogen from animals. The polysaccharide 
can be used in food and nonfood industries to replace and/ or complement starch 
functionalities. A large amount of these polysaccharide can be produced by fermentation 
technology. The following materials were employed in the construction of the present 
invention some of the starting material are commercially available from Novagen in Madision 
Wisconsin ET-23d(+) DNA under catalog number 69748-1 and BL21(DE3) under catalog 
number 69387-1; ET-21a(+) DNA under catalog number 697401. 

Plant Hosts 

The following plasmids have been transformed into rice plants Transgenic 1, 
MSTSIA(pExs52) and glgC 3 (pExs66), MSTSIIa and glgC 3 (pExs53 and P Exs56). The second 
group of rice transformatns contain MSTSIIc and glgC 3 (pExs54 and pExs56).The third group 
of transformation: transgenic 5 MSTSIII and glgC 3 ( pExs 61 and pExs 66); transgenic 6 Mwx 
glgC 3 p-Exs65 and pExs66). Generally see figures 25-41 for plasmid maps and figure 43-55 
for sequences used in the plasmid. Additionally, glgA and glgB and glgC were combined and 

transformed into rice. This is combinin g the rice plants starch pathway with the__gene 

encoding for ADPG and the genes encoding for at least one of the following enzymes, SSI, 
SSII, SSIII, Debranching enzymes, BEI, BEII, GBSS (wx). 
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These plasmids could have been transformed into other cereals such as corn, wheat, 
barley, oats, sorghum, milo in substantially the plasmid that is shown in the figures for the 
plant host. The promoter could be the waxy gene which is published, other additional zein 
promoters are known and could be used as the promoter. The promoter used herein is 
described in Figures 45aand 45b. 

Additionally these plasmid with little additional work could be transformed into dicots 
such as potatoes, sweet potato, taro, yam, lotus cassava, peanuts, peas, soybean, beans, 
chickpeas. The promoter could be selected to target the starch storage area of the particular 
dicots (some are roots some are tubers). Various method of transforming monocots and dicots 
are known in the industry and the method of transforming the genes is not critical to the present 
invention. The plasmid can be introduced into Agrobacterium tumefaciens by the freeze-thaw 
method of An et al.(1988) Binary vectors. In Plant Molecular Biology Manual A3, S.B. Gelvin 
and R.A. Schilperoot,eds (Dordrecht, The Netherlands: Kluwer Academic Publishers), pp. 1- 
19. Preparation of Agrobacteruim inoculum carrying the construct and inoculation of plant 
material , regeneration of shoots, and rooting of shoots are described in Edwards et al. (1995). 
Biochemical and molecular characterization of a novel starch synthase from potatoes. Plant J. 
8, 283-294. Additionally promoters for different dicots are known particularly 35sCaMV and 
Monsanto has also published a promoter that is useful in potatoes called a patatin promoter. 

A number of monocots are also starch bearing plants but until about a decade ago 
monocots were difficult to develop transformants. The most prominent methods of 
transformation presently used in monocots is the gunning of micro projectiles into the plants or 
using Agrobacterium and subsequent regeneration of the plants from the transformed materials. 
Various tissues and cells can now be transformed with plasmids into monocot hosts. In fact 
there are teaching from at least five ago on methods of transforming not only callus but also 
cotyledons. The methods of transforming plants and selecting for the transformants with either 
selectable or screen able markers are also well known. The use of the marker in the same 
plasmid and the use of the markers in a separate plasmid that is co transformed into the host are 
well known in the art by those of ordinary skill in the art. The biotechnology methods of 
forming plasmids and transforming plants are listed in the book entitled AShort Protocols In 
Molecular Biology@, 3rd ed. Published in 1995 by JOHN WILEY& Sons, Inc. Additionally , 
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methods of transforming with the gun and with protoplasts are taught in a number of issued 
patents to Dekalb and Agracetus and Ciba. 

PREFERRED EMBODIMENT - OPERATION 

EXAMPLE 1 
Construction of the E. coli expression vector. 

The expression vector pExs2 was derived from pET-23d (Novagen) and pGPl-2 (15). 
The expression vectors pExs-trc and pExs-trc3 were derived from pTrc99a (Pharmacia) and 
pGPl-2. The Bglll/PstI fragment (2192 bp) containing the pBR322 origin of replication was 
deleted from pET-23d and replaced with the BamHI/PstI fragment (3 kb) containing origin 
pi 5 A and kanamycin resistance gene from pGPl-2. This process generated plasmid pEXSl 
containing both ampicillin and kanamycin resistance genes. The ampicillin resistance gene was 
inactivated by deletion of the Seal / Bgll fragment (360 bp, Bgll end was filled in and blunt- 
end ligated with Seal end). Inactivation of the ampicillin resistance gene in pEXS 1 generated 
the expression plasmid pEXS2, containing the T7 promoter, T7 terminator, kanamycin 
resistance gene and pl5A origin of replication. Plasmid pTrc99a was digested with Ndel, filled 
in with kelnow fragment and blunt-end ligated to remove Ndel site. A Ndel site was 
introduced at the Ncol site by mutagenesis to generate plasmid pExs-trc. The Bgll and PvuII 
fragment(2.48 kb) in pExs-trc containing the pBR322 origin of replication was replaced by 
Bgll/BamHI (filled in with Klenow fragment) fragment (3 kb) containing origin pi 5 A and 
kanamycin resistance gene from pGPl-2 to generate pExs-trc2. The ampicillin resistance gene 
was inactivated by deletion of the Seal / Bgll fragment (360 bp, Bgll end was filled in and 
blunt-end ligated with Seal end). Inactivation of the ampicillin resistance gene in pExs-trc2 
generated the expression plasmid pExs-trc3. 

Construction of expression plasmids for maize SS. For expression of maize SS in E, 
coli, The PCR method was used to modify the N-terminus of maize SS using the following 
nucleiotides: primer Exs4 (5'-CAAGAATGCTGCGGGAGTC-3'), primer Exs23 (5 f - 
AAGTCGACATATGTGCGTCGCGGAGCTGAGCAG-3'), primer Exs 57 (5'- 
GGGCCCCATATGAGCATTGTCTTTGTAACCGG-3 1 ), primer Exsl (5'- 
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CTCGGGCCCATATGGGGGAGAATGTTATGAA-3'), primer Exs2 (5 1 - 

G AGGC ATC AATG AAC AC AAAGTCG-3 '), Primer Exs33 (5'- 

GAAGGGCCCCATATGGCTGAGGCTGAGGCCGGGGGCAAG-3'), primer Exsl6 (5'- 
TTGGATCCATATGGGAGCTGCGGTTGCATTGGG-3 1 ) and primer Exsl7 (5'- 
CCTGCGGGCTCTGGCTTCACC), primer Exs 55 (5 1 . 

TTGGATCCATATGAACGTCGTCGTGGTGGCTTC-3'), primer 56 (5'- 

GCATACCATGGAACCTCAACAGC-3'), primer 53 

GGTACCATATGAACGTCGTCTTCGGCG-3'), primer Exs 54 (5'- 

GACAGGCCCGTAGATCTTCTCC-3'), primer Exs -wx (5'- 

TTGGTACCATATGGCCAGCGCCGCCGGCATGAACG-3'). Primer Exs 4 paired 
respectively with primer Exs23 and Exs 57 was to modify the N-terminus of maize SSSI gene 
to generate pExs-10 and pExs-ld. Primer Exs2 paired individually with primer Exs33 and Exsl 
was to riiodify the N-terminus of maize SSSII to generate pExs3c and pExs3a. Primer Exs 17 
paired individually with primer Exsl6 and Exs55 was to modify the N-terminus of maize 
SSSIII to generate pExs-9 and pExs-9a. Primer Exs54 paired individually with primer Exs-wx 
and Exs53 was used to modify the N-terminus of maize GBSS to generate pExs-wx and pExs- 
wx2. The modified N-terminus was recombined with the rest of the SS gene in pBluescript SK 
plasmid. The reconstructed of maize SS was subcloned from pBluescript SK to the Ndel/NotI 
sites of the expression vector pET-21a (Novagen) , pExs-trc, P Exs-trc3 (maps are attached, 
Table I shows the N-terminal sequence of SSS). 

EXAMPLE 2 

Construction of expression plasmids for E. coli ADPGIc pyrophosphorylase, BE 
and maize SBE. 

E. coli glgB gene was excised from plasmid pOP12 (16). The BstXl (filled in) / Hindll 
fragment containing the glgB ribosome binding site and the full length glgB gene was cloned at 
the Smal site of pBluescriptSK- (Stratagene). The glgB gene in pBluscriptSK- was 
subsequently cloned into pEXS2 at the Xbal / Sail sites to generate plasmid pEXSB. Primer G 
(5'- GAAGATCTGGCAGGGACCTGCACAC-3*) and primer H (5«- 
GGACT AGTGC ATTATCGCTCCTGTTTAT-3 ') were used to PCR the E. coli glgC gene 
coding for ADPGIc pyrophosphorylase from plasmid pOP12. A Bglll site and a Spel site 
introduced by PCR to the N-terminal and C-terminal site respectively, were used to clone the 
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PCR product into pBluscript SK- at the BamHI and Spel sites. The glgC gene including its 
own ribosome binding site was subcloned into expression plasmid pEXS2 at the Xbal (filled in 
with Klenow fragment) and NotI site to generate plasmid pEXSc. The genes coding for mature 
maize SBEI and SBEII along with a ribosome binding site were subcloned from plasmids pET- 
23d-SBEI and pET-23d-SBEII into the plasmid pEXSc at the Spel site to form the plasmids 
pEXSc-SBEI and pEXSc-SBEII. The gene coding for mature maize SBEII including a 
ribosome binding site was cloned into pEXSc-SBEI at the Xbal/NotI sites to form plasmid 
pEXSc-SBEI-SBEIL E.coli glgc gene and genes encoding maize SBEI and SBEII were also 
cloned in plasmid pExs-trc and pExs-trc3 respectively and together as described for pExs2. 

EXAMPLE 3 

Isolation of E. coli HPG204 deficient in GBE and GS activities 

Homologous recombination was used for the strain construction. This was done 
according to the method described by Hamilton et al ( Journal of Bacteriology, 1989, 
171:4617-4622.) A temperature-sensitive pSClOl replicon was used to facilitate the selection. 
The gene coding for spectinomycin adenyltransferase was inserted at PvuII sites in plasmid 
pOP12 to form plasmid HPG9 which has spectinomycin resistance and has C-terminus of glgB 
gene and N-terminus of glgA gene deleted. The DNA fragment B=SA= with Spectinomycin 
resistant gene inserted between partial truncated glgB and glgA was subcloned into plasmid 
pMAK705 at Xbal site containing temperature sensitive replicon (Hamilton et al. Journal of 
Bacteriology, 1989, 171:4617-4622.) to form Plasmid pMak705B=SA= Plasmid 
pMak705B=SA= was transformed into TGI cell After the transformed cell was cultured in 3 
mL LB with 100 mg/mL Spectinomycin at room temperature overnight, the cells were plated 
on LB agar plate containing 100 mg/mL spectinomycin and incubated at 44°C overnight. 
Single colonies were inoculated on LB agar plate containing 100 mg/mL spectinomycin and 
0.2% glucose and incubated at 44°C and at 37°C overnight. The colonies at 37°C were stained 
with iodine. The colony with negative staining was selected and grown in 100 mL LB at 37°C 
overnight. The cells were harvested and homogenized in an extraction buffer for assaying 
glycogen synthase and branching enz yme activities. The cell lackin g glgA and glg B activities 
was named as HPG204 [F=traD36 Lacl q D(glgBXCA) D(LacZ)M15proA + B7SupED (hsdM- 
mcrB)5(r k ' m k " McrB")thiD(lac-proAB),Spectinomycin R , Chloramphenicol 11 ] .The IDE3 
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lysogenization kit fromNovagen was used for site specific integration of 1DE3 prophage into 
E. coli HPG204 to form E. coli HPG204(DE3)[ was DThe lysate was prepared with Plvir and 
its transduction into E. coli BL21 (DE3) [F=traD36 Lacl q D(glgBXCA) 
D(LacZ)M15proA + B7SupED (hsdM-mcrB)5(r k " m k ' McrB )thiD(lac-proAB),Spectinomycin R , 
Chloramphenicol ] 



EXAMPLE 4 
Expression of maize SS and SBE in E. colL 

Plasmid pExs-2 and pExs-trc3 has kanamycin resistance and pi 5 A origin of replication. 
It is compatible with plasmid pET21a, pExs-trc, pTrc99A containing pBR322 origin. 
Expression plasmids pExs-2 and pET-21a were used to express SS and SBE in E. coli 
HPG204(DE3). Expression plasmids pExs-trc and pExs-trc3 were used for expression in E. 
coli G6MD3. This made it possible to transform different combinations of maize SS and SBE 
in E. coli HPG204(DE3), or G6MD3 which is deficient in GS and GBE activity. An overnight 
culture of cells transformed with maize SS and SBE was diluted 1:20 (v/v) in fresh LB 
containing 0.2 % glucose, 100 mg/mL ampicillin and 50 mg/mL kanamycin. The cells were 
grown at 37°C for about 2 h to A600nm =0.6 before the expression of maize SBE and/or SS 
was induced by adding isopropyl b-D-thiogalactoside to 0.5 mM. Following growth at 25°C for 
4 h, the cells were harvested in a refrigerated centrifuge. 



EXAMPLE 5 
Isolation of highly branched a-glucan from E. coli. 
Cell pellet (30 g) was resuspended and lysed by sonication in 150 mL 50 mM tris- 
acetate buffer (pH 7.5) containing 10 mM EDTA and 5 mM DTT. After a fraction of the 
homogenate was saved for assaying the STS and SBE activities, the homogenate was 
centrifuged at 20, OOOg for 50 min at 4°C. After collecting the supernatant, the pellet was 
resuspended in 150 mL water and boiled for 15 min with occasional stirring. The resuspension 
was centrifuged at 20, OOOg for 30 at room temperature. After collecting the supernatant, the 
pellet was washed again with 100 mL water as above. 0.1 volumes of 50% Trichloric acid 
(TCA) were added to the pooled fractions. After storing on ice for 30 min, the precipitate was 
spun down at 15,000g for 20 min, then washed with 30 mL 5% TCA and centrifuged as above. 
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The supernatant and wash were pooled and one volume of absolute ethanol was added. After 
storing on ice for 30 min, the polysaccharide was collected by centrifuging at 15, OOOg for 15 
min. The polysaccharide was redissolved in water and precipitated with ethanol. This step was 
repeated twice. The pellet was washed with methanol twice, acetone twice and dried over silica 
gel at room temperature. 



EXAMPLE 6 
Isolation of linear a 1,4 polysaccharide from E. coli 



Resuspend 50 grams of cell pellet in 250 mL of 50 mM Tris acetate buffer, pH 7.5, 
containing 10 mM EDTA and 5 mM DTT. Sonicate for 3 minutes (45 seconds/time, output # 
8, repeat 4 times with 30 seconds interval). The homogenate is centrifuged at 12,000 rpm 
(SA1500) for 50 minutes. The supernatant is checked with iodine staining and discarded. 
(Same 1 mL homogenate and 1 mL supernatant for enzyme assay. The pellet is resuspended & 
extracted in 100 mL DMSO. Extract the polysaccharide by heating and stirring in boiling water 
bath for 15 min. Let it cool down to below 40 °C and centrifuge at 12,000 rpm for 30 min at 
room temperature. The supernatant is pooled. The pellet is extracted two more times with 100 
mL DMSO. Equal volume of absolute ethanol is added into the pooled supernatant, mixed and 
stored on ice for 30 minutes. Centrifuge at 12,000 rpm for 30 min at 4 °C . The pellet is 
redissolved in 20 mL DMSO by heating in boiling water bath. 80 mL water is added and mixed 
well. After adding 10 mL butanol to the solution, the solution is mixed and stored at 0 °C for 
one hr (mix once a while). Centrifuge at 12000 rpm for 30 min at 4°C. Repeat the step once. 
The pellet is redissolved in 90 mL hot water by heating in boiling water bath. Insoluble 
materials are immediately removed by centrifugation at room temperature. Add 10 mL butanol 
to the supernatant, stay at 0 °C for 1 hr and centrifuge at 12000 rpm for 30 min at 4 °C. Repeat 
the step once. The amylose precipitate is redissolved in 90 mL hot water by heating, and 1 0 mL 
butanol are added to the solution. After storing at 40 °C on ice for one hour, it is centrifuged at 
4 °C for 30 min. Repeat the step once. The pellet is redissolved in 100 mL 10% butanol by 
heating. The amylose is stored at 0 °C and precipitated by centrifuging at 12000 rpm for 30 
min at 4 °C. The pellet is washed with 25 mL methanol 3 times and with_acetone-once._Dry_ 
over silica gel. 
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EXAMPLE 7 
Enzyme assays 

5 mL of supernatant were used to assay STS and SBE activities as previously described 
(Preiss) with minor modification. The reaction mixture for STS contained 100 mM Bicine 
buffer, 10 mg/mL glycogen, 0.5 mg/mL BSA, 0.5 M sodium citrate, 25 mM potassium acetate, 
10 mM GSH, 3 mM [14C]ADPGlc (500 dpm/nmol) and enzyme in a final volume of 0.1 mL. 
The reaction was carried out at 25°C for 15 min and terminated by boiling for 2 min. The 
unincorporated [14C]ADPGlc was separated with Dowex anion exchange column (200-400 
mesh, Sigma Chemical Co.). One unit of activity is defined as 1 nmol Glc incorporated into the 
a-glucan per min at 25°C. SBE activity was determined by phosphorylase stimulation assay. 
One unit of activity is defined as 1 mmol Glc incorporated into the a-glucan per min at 30°C. 

Example 8 
Enzyme purification 

For the recombinant SS purification, the cell pellet was resuspended in sonication buffer (50 
mM Tris-acetate, pH 7.5, 10 mM EDTA, and 5 mM DTT; 7 ml buffer per gram of cell mass), 
and cells were lysed using a Fisher 550 Sonic Dismembrator with 5 x 1 min. bursts with 30 
sec. intervals. The homogenate was centrifuged at 9600g for 30 minutes. SSI in the 
supernatant was then precipitated by slowly adding neutralized saturated ammonium sulfate to 
40% saturation. After stirring on ice for an additional 50 minutes, proteins were collected by 
centriftigation at 12700g for 45 minutes. The protein pellet was then redissolved in buffer A 
(50 mM Tris, pH 7.5, 1 mM EDTA, and 5 mM DTT) containing 0.1 M KC1 and dialyzed 
against the same buffer, with one change of buffer. After dialysis, the sample was centrifuged 
at 13000g for 20 minutes to remove insoluble materials. The resulting supernatant was loaded 
onto an amylose affinity column pre-equilibrated with dialysis buffer, and the flow through 
was collected. The column was washed with 10 column volumes of buffer A containing 0.1 
M KC1, and then with buffer A containing 0.5 M KC1 and 0.5 M maltose, collecting fractions 
during both washes. The active fractions were pooled and dialyzed overnight against buffer A, 



BNSDOCID: <WO 9844780A1_I_> 



WO 98/44780 




PCT/US98/06660 



with one change of buffer. The next day, the amylose column sample was filtered and applied 
to a mono Q 5/5 FPLC column (Pharmacia). After washing with buffer A, a 20 ml 0-0.4 M 
KC1 gradient was employed. The active fractions were electrophoresed on an 8% SDS-PAGE 
gel (31) to determine the purity of SSI in those fractions; the fractions which were apparently 
homogeneous were pooled and concentrated using a Centricon-30 spin column (Amicon). 
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Table 1. Expression of maize starch synthases in Escherichia coli 
BL21(DE3). 

Plasmids Maize starch synthase N-terminus Protein Specific 

f^fo^ aenes (mginL, 





pET21a 


Native plasmid 




1.8 




0.009 




10 


pEXS-3a 


SSIIa-2 


GENVMNVIW 




2.8 








pEXS-3c 


SSIIa 


AEAEAGGKD 




2.8 




0.28 


15 


pEXS-1d 


SS1-3 


MSIVFVTGEA 




3.0 




0.23 


pEXS-8 SSJ-2 




GDLGLEPEG 


1.9 




0.097 






pExs-10SSI 




CVAELSREG 


1.2 




0.043 




20 


pEXS-9 SSIIb 




GSVGAALRSY 


1.8 




0.515 






pEXS-9a 


SSIIb-2 


MNWWASEC 




2.6 




0.36 


25 


pEXS-wx 


GBSS (waxy) 


ASAGMNWFV 




2 




0.033 


pEXS-wx2 


GBSS(2) 


MNWFVGAEM 




2.2 




0.32 



using 5 mg/mL glycogen as primer. 

30 
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Table 6. Purification Tables for SSI-1, SSI-2, andSSI-3. 



SSM 


volume 
(ml) 


total mg 
protein 


activity 
U/mg 


total 
Units 


purification 
(fold) 


Homogenate 


630 


4347 


0.018 


76^2 


1 


Supernatant 


570 








1.1 


0-40% rP^BL^SOd 


48 






28.7 


3.2 


Amvlose column 


17 






11.3 


279 


monoO column 


0 27 






3.2 


677 


SSI-2 


volume 
(ml) 


total mg 
protein 


activity 
U/mg 


total 
Units 


purification 
(fold) 


Homogenate 


380 


2797 


0 0356 


99 /5 


* 
i 


Sun^rnii tant 

wuptl llaUtul 






U.UJ40 


72.0 


1 


0-40% fNHYUSO, 


*#© 


4#C£ 




61.8 


3.7 


Amvln^i* rnlnmn 




1.2 


22. □ 


26.5 


634 


monnO rnlnmn 

iuvuu\f vUl U 111 11 


i n 




17.2 


5.6 


483 


SSI-3 


volume 
(ml) 


total mg 
protein 


activity 
U/mg 


total 
Units 


purification 
(fold) 


Homogenate 


1300 


16770 


0.23 


3900 


1 


Supernatant 


1100 


9790 


0.31 


3080 


1.3 


0-40% (NH^SO* 


237 


2204 


1.5 


3294 


6.5 


Amylose column 


63 


30 


22.4 


668 


97 


monoQ column 


3.6 


3.1 


30.5 


93 


132 



Notes: Assays performed during the course of purification contained 10 mg/ml glycogen and 3 mM 
[U- C]-ADPGlc. Assays were performed at room temperature in the presence of 0 5 M citrate 1 
Unit = 1 umol [U- C]-glucose transferred per min. 
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Table 7. Primer Kinetics for SSI enzymes 
Amylopectin 





SSI-3 


SSI-2 


SSI-1 


+ citrate Km 


240 


± 45 


230 ± 50 


150 


± 40 


V max 


26.3 


± 0.5 


33.4 ± 2.1 


22.5 


± 0.6 


- citrate Kg, 


230 


± 60 


68 ± 3 


120 


± 20 




13.2 


± 0.3 


9.94 ± 0.18 


7.62 


± 0.99 



Glycogen 





SSI-3 


SSI-2 


SSI-1 


+ citrate *V mai 


43.4 ±2.5 


_ 45.6 ± 3.3 


39.0 ±2.2 


- citrate "V mai 


41.4 ±2.9 


45.5 ± 1.5 


26.1 ± 1.4 



~ £'*vf ^ lit 37 ; C 35 described in the Materials and Methods. Data are expressed 

I SSEE er an^V ati ° nS *»* -th the standard deviation. K. are expressed 

as ug/ml pnmer and V ra are in umol/min/mg protein. ADPGlc = 3 mM in aU assays. 

•Because saturating glycogen concentrations could not be obtained, a standard 20 ma/ml zlycoeen was 
used to compare enzyme rates for that primer. mg/mi glycogen was 
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Table 8. ADPGlc Kinetics for STSI enzymes. Assays and data evaluation are as in Table H 
Km are expressed as mM ADPGlc and V m are in ^mol/min/mg protein. 5 mg/ml amyiopectin was used as 
primer for all assays. 





STSI-3 


STSI-2 


STSI-1 


+ citrate Km 


0.33 ± 0.07 


0.32 ± 0.02 


0.18 ± 0.02 


v m 


26.4 ± 1.4 


32.6 ± 0.8 


18.0 ± 0.5 


- citrate Km 


0.62 ± 0.04 


0.25 ± 0.04 


0.24 ± 0.02 


v m 


14.7 ± 1.3 


11.7 ± 0.7 


6.38 ± 0.88 
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Table 9. Purification Tables for SSUa enzymes. Assays for SSIIa-2 purification 
contained 10 mg/ml glycogen and L5 mM [U- l4 C]-ADPGIc (both are at saturating 
concentrations). Assays for SSHa-1 purification contained 5 mg/ml amylopectin and 3 mM 
[U- C]-ADPGIc Assays were performed at room temperature in the presence of 0.5 M 
citrate. 1 U = 1 Aimol [U- 14 C]-gIucose transferred per min. 

SSHa-2 





volume 


total mg 


activity 


total 


purification 




(ml) 


protein 


TJ/mg 


Units 


(fold) 


Supernatant 


300 


1620 


0.0216 


34.3 


1 


0-40% (NBU)2S0 4 


53 


419 


0.0606 


25.4 


2.8 


Amyiose column 


20 


9.3 


0.991 


93 


45.9 


monoQ column 


0.9 


0.94 


4.81 


4.5 


222 



ssna-i 

volume total mg activity total purification 





(ml) 


protein 


U/mg 


Units 


(fold) 


Supernatant 


335 


2613 


0.28 


737 


1 


0-40% (NH4);S04 


47 


427 


0.96 


409 


3.4 


Amyiose column 


25 


11.5 


8.04 


92 


28.7 


monoQ column 


1.0 


4.8 


9.10 


44 


32.5 
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Table 10. Primer Kinetics for SSUa enzymes. Assays were performed as 
described in the Materials and Methods. Data are expressed as the average of three 
independent determinations along with the standard deviation. K„ are expressed in 
Jig/ml and are in ^mol/min/mg protein. ADPGIc = 3 mM in all assays. *NA = not 
applicable; enzyme cannot be saturated by primer under these conditions. 



Amylopectin 







SSIIa-2 


SSIIa-1 


+ citrate 








27°C 


Km 


153 ± 22 


182 ± 38 




V 

» max 


7.82 ± 0.63 


24.1 ± 0.5 


37°C 




133 ± 18 


153 ± 64 




V 


15.4 ±0.6 


41.1 ± 0.2 


- citrate 








27°C 




234 ± 30 


404 ± 33 






4.31 ± 0.32 


10.5 ± 0.3 


37°C 


K m 


1350 ± 220 


-NA" 




V 

v max 


7.84 ± 0.25 


-NA" 


Glycogen 






SSIIa-2 


SSIIa-1 


+ citrate 








27°C 




50.7 ± 3.8 


162 ± 17 




Vmax 


5.53 ± 0.44 


14.2 ± 0.7 


37°C 


Km . 


76.9 ± 7.8 


350 ± 11 






11.3 ± 0.7 


31.6 ± 0.8 
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-rfhli tt r ^ mTU * CS f° r SSUa en*™*. Assays and data evaluations are as in 

\ ™tJr ?T n T * Primer In 62011 was saturati °g for each enzyme and was 
detenmned by the experiments detailed in Tabie EL K„ are expressed as mM ADPGIc and 
are in jimol/nun/mg protein. *NA = not applicable, as the enzyme cannot be saturated by 
primer under these conditions. «r*reu oy 







SSIIa-2 


SSIIa-1 


+ citrate 








27°C 




0.17 ± 0.04 


0.48 ± 0.09 




V ma x 


4.83 ± 0.42 


23.0 ±2.5 


37°C 


Km 


0.28 ± 0.01 


0.83 ± 0.08 




V 

v max 


11.4 ± 0.6 


49.1 ±2.6 


- citrate 








27°C 


K«~~ 


0.27 ± 0.02 


0.46 ± 0.06 




V ma x 


4.87 + 0J25 


12.1 ±0.8 


37°C 


Kg, 


0.28 ± 0.005 


-NA* 




V m ax 


7.86 ± 0.53 


-NA* . 


with glycogen as primer 


with alvcogen 


SSIIa-2 


SSIIa-1 


+ citrate 








27°C 


K m 


0.16 ± 0.03 


0.19 ± 0.02 






4.41 ± 0.21 


17.1 ± 0.7 


37°C 


Km 


0.15 ± 0.03 


0.37 ± 0.04 




v 

▼ max 


7.60 ± 0.94 


40.1 ± 1.7 



BNSDOCID: <WO 9844760AlJ_> 



WO 98/44780 




PCT/US98/06660 



Table 12. Purification Tables for SSIIb-2 and SSIIb-L Assays performed during the 
course of purification contained 10 mg/ml glycogen and 3 mM [U- 14 C]ADPGic. Assays were 
performed at room temperature in the presence of 0.5 M citrate. 1 U = 1 jimol [U- 
14 C]glucose transferred per min. 

SSIIb-2 volume total mg activity total purification 





(ml) 


protein 


U/mg 


Units 


fold 


Supernatant 


890 


9256 


0.48 


4450 


1 


0-40% (NH4) 2 S0 4 


190 


2660 


1.24 


3306 


2.6 


Amy lose column 


13 


31.2 


50.6 


1573 


105 


monoQ column 


6.6 


16.3 


56.8 


939 


118 



SSIIb-1 volume total mg activity total purification 





(ml) 


protein 


U/mg 


Units 


fold 


Supernatant 


365 


2336 


0.64 


1533 


1 


0-40% (NH^SC^ 


56 


436 


2.35 


1030 


3.7 


Amylose column 


80 


10.4 


50.2 


521 


78 


monoQ column 


0.6 


0.28 


60.6 


17.6 


94 
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Table 13. Kinetics for SSIIb enzymes. Assays were performed at 37°C as 
described in the Materials and Methods. Data are expressed as the average of 
three independent determinations along with the standard deviation. For ADPGlc 
kinetics, K„ are expressed in mM ADPGlc. For primer kinetics, Km are 
expressed as mg/ml primer, and 3 mM ADPGlc were used in the assays. V„,» 
are in |i.mol min'mg" 1 protein. 

ADPGlc Kinetics 





SSIIb-2 


SSIIb-1 


with glycogen 






K m 


0.32 ± 0.04 


0.71 ± 0.01 


V max 


130 ± 6 


76.8 ± 3.2 


with amylopectin 






Km 


0.32 ± 0.03 


0.40 ± 0.02 


V max 


90.9 ± 4.2 


72.8 ± 2.8 


Primer Kinetics 




SSIIb-2 


SSIIb-1 


glycogen 






Km 


0.36 ± 0.02 


0.43 ± 0.02 




120 ± 3 


79.5 ± 3.3 


amylopectin 






Km 


0.26 ± 0.04 


0.074 ± 






0.008 


V max 


84.5 ± 2.4 


67.9 ± 1.7 
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Table 14. Comparison of kinetic data for expressed SS's. Data for SSI and SSIIa are 
form Imparl-Radosevich et aL, 1998; Imparl-Radosevich J>, Li P, McKean Al, Keeling PL, 
and Guan HP, submitted for publication. for amylopectin and glycogen are epxressed in 
mg/ml; for ADPGlc are in mM and were determined in the presence of amylopectin and 0.5 
M citrate. V max are in pmol minting" 1 . The for glycogen for SSI could not be determined as 
saturating concentrations of glycogen could not be reached for this enzyme. 



Kinetic parameter SSI-3" SSI-1 SSna-2* SSIIa-1 SSnb-2* SSHb-1 

Km for amylopectin 0.24 0.15 0.13 0.15 0.26 0.07 

Ko, for glycogen — — 0.077 0.35 0.36 0.43 

K.,, for ADPGlc 0.33 0.18 0.28 0.83 0.32 0.40 

V«„ (with amylopectin) 26.3 22.5 15.4 41.1 84.5 67.9 

V mai (with glycogen) 43.4 39.0 11.3 31.6 120 79.5 



Menotes N-terminally truncated form of SS, while any SS with 
the designation SS-1 is the full length version of the SS. 
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Table 15. The starch synthase activities of the chimerical enzymes 
Generation of chimerical enzymes of maize starch synthase- the 
recombination of N-terminal extensions with C-terminal catalytic domains of 
starch synthase Ul 

The gene coding for N-terminal extensions of STSI, STSIIa and STSIIb 
were inserted, in the same (+) or reverse (-) orientation of original N-terminal 

^ ot 6 ^ ence ' ln front of the c -t en ™al catalytic domains of WX2 STSIIa 
and STSIIb, respectively. The chimerical enzymes were expressed in E coli 
and the activities were assayed. 





WX2 
C-catalytic domain 


STSIIa 
C-catalytic domain 


STSIIb 
C-catalytic domain 


STSI 

N-extension 


N1-WX2 

( + ) (-) 
NRA 9.0 


N1-C2 

(+) (-) 
6.6 39.7 


N1-C3 

( + ) (-) 
89.2 NRA 


STSIIa 
N-extension 


N2-WX2 

( + ) (-) 
9.2 11.2 


N2-C2 

( + ) (-) 
213.8 8.7 


N2-C3 

( + ) (-) 
232.5 NRA 


STSIIb 
N-extension 


N3-WX2 

( + ) (-) 
NRA NRA 


N3-C2 

( + ) (-) 
11.2 NRA 


N3-C3 

( + ) (-) 
400.5 12.0 



Nl: STSI N-teiminai extension; 
N2: STSIIa N-terminal extension; 
N3: STSIIb N-terminal extension; 
WX2: WX2 C-terminal catalytic domain; 
C2: STSIIa C-terminal catalytic domain; 
C3: STSIIb C-terminal catalytic domain. 

olZtrton' 1 *™* extensions were "«rted in front of the C-terminal catalytic domains in same 

(-): the N-terminal extensions were inserted in front of the C-terminal catalytic domains in reverse 
orientation. 

* Starch synthase enzyme activity: nmol/min mgprotein. 

* The residue glycogen synthase activity of BL21(DE3) is 2.6nmol/min mg protein 

* NRA— No recombinant available. 
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The photographs listed in the figures 42 and 21 attempt to show the visual differences 
that are present into the starches as compared to those known in the art. 

Description of the starch 

Corn starch is a milky , slightly thickened gel which is slightly if at all flowable. 

Rice starch forms two levels the upper level is a thickened syrup like consistency more 
flowable then corn starch (less thick then corn starch) opaque milky color (more translucent 
then corn starch in this level) and a lower level which is a very white glob not transmitting 
much light through this bottom level of material. This lower level is formed in a very thick 
mass and does not appear flowable. 

Corn amvlopectin is slightly less white then the top level of rice starch and is a very 
slightly opaque milky color (more translucent then com starch) slightly less flowable then the 
rice top level. 

Potato dextrin is the most transparent almost appearing clear but is still opaque white 
and it is very flowable appearing only slightly less flowable then water. 

Waxv Maize starch will flow very slowly and has the consistency of honey. The color 
is very opaque transmitting little light and the color is only slightly less light then corn starch. 

SSI starch made from plasmid pExs-8 has two distinct levels. The top level appears 
clear and slightly thicker then the flowability of water. The bottom level appears as a 
precipitate. This sample resembles the ornaments that contain little figures and plastic flakes 
resembling snowflakes. Like those ornaments when turned upside down the sample appears to 
be falling snow. However the flakes in this sample appear to be slightly gummy and appear in 
the first moments of level mixing to form a opaque white liquid. 



SSI starch made from a host containing the following two plasmids pExsC BEI BEII 
and pExs8 is not as clear as the top level of pExs-8 and appears slightly less thick then pExs-8. 
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It has even more flowability then does Potato Dextrin. 

SSIIb starch made from a host containing the following two plasmids pExsC BEI BEII 
and pExs-9 is not as clear as the top level of pExs-8 and appears slightly less thick then pExs-8. 
It has even more flowability then does Potato Dextrin. 

WAXY starch made from a host containing the following two plasmids pExsC BEI 
BEII and pExs-wx is not as clear as the top level of pExs-8 but seems to have a few tiny thread 
like chains that settle to the bottom and when mixed give the material a slightly more white 
color and appears slightly less thick then pExs-8. It has even more flowability then does Potato 
Dextrin. 

SSII starch made from a host containing the following two plasmids pExsC BEI BEII 
and pExs-3a is the color of com starch and maybe slightly whiter but not as white as the 
bottom level of pExs8 and definitely transmitting more light through and has the flowability 
characteristic of pExs-8 when mixed. 

glgA starch appears to have a very slight precipitate and is comparable in color to corn 
amylose pectin and ExsC BEI BEII and pExs-wx. And the flowability is between corn 
amylose and pExsC BEI BEII pExs-wx. 

The samples of polysaccharides listed above form groups generally according to color 
as follows: waxy maize starch and corn starch and pExsC BEI BEII pExs3a and pExsc8 are 
the whitest group. The flowability characteristics of this group are fairly diverse. With corn 
starch a lump and Waxy maize starch only slightly flowable and pExsC BEI BEII and pExs- 
3 a and pExsC-8 more like water then syrup. The second group contains com amylopectin and 
pExsC BEI BEII pExs-wx and pExsC BEI BEII and pExs-Al which are less white and clearer. 
The flowability of com amylopectin is less then the other two members of this group but it is 
still similar. The last group is the least white and thus the clearest. This group includes pExsC 
BEI BEII and pExs-8, potato dextrin, pExsC BEI BEII and pExs-10, pExsC BEI BEII and 
pExs-9. The flowability of this group is also similar to each other. 
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Plant H sts 

The following plasmids have been transformed into rice plants. The sequence for the 
mutant glgC gene is shown in Figure 46. The plasmids are made substantially in a similar 
manner as described above for the production of bacterial plasmid. Clearly the plasmid maps 
shown in figures 25-41 and this application and the listed short protocols allow the ordinarily 
skilled person in the art to make the present plasmids. The following combinations of plasmids 
have been transformed into rice plants. Additionally combinations of plasmids including the 
combination that includes all of the maize genes SSI, SSII, SSII, BEI, BEII, and GBSS in one 
host or alternatively in two host that are then crossed to form a hybrid having the entire 
complement of up regulated starch genes are being developed. Clearly the ordinarily skilled 
person in the art could have placed the sequences in the antisense positions to down regulate c 
these genes to the extent that maize genes will down regulate the partial homologous rice 
genes. The first group of transgenic are group 1, including rice transformants (transformed by 
microprojectile bombardment) containing MSTSI-2 (pExs52) and glgC 3 (pExs66), MSTSIIa-2 
and glgC 3 (pExs53 and pExs56). The second group of rice transformants contains MSTSIIa 
and glgCj (pExs54 and pExs56). The third group of transformation contain: transgenic 5 
MSTSIIb and glgC 3 ( pExs 61 and pExs 66); transgenic 6 Maize wx and glgC 3 pExs65 and 
pExs66). Additionally, glgA and glgB and glgC are combined and transformed into rice. This 
last transformant is combining the rice plants starch pathway with the gene encoding for 
ADPG pyrophosphorylase and the bacterial genes. The combination of the plasmids encoding 
for at least one of the following enzymes, SSI, SSIIa, SSIIb, SSIII, Debranching enzymes, BEI, 
BEII, GBSS (wx)and some or all of the bacterial starch genes is also useful. There ae presently 
over 300 transformants in the greenhouse.The Tl transgenic rice plants have been screened and 
characterized (Figure 56, 57). 12 plants have successfully expressed maize SSI-2 in rice seeds. 
21 plants have successfully expressed maize SSIIb in rice seeds. We are currently screening 
rice plants down regulated the rice SS expression by cosuppression and have 400 T2 plants in 
the greenhouse. 
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Maize Starch Synthase and its Mutant Forms. 

In order to characterize the multiple forms of maize starch synthase, the genes coding 
for the full length SS and its N-terminally truncated forms were expressed in Kcoli,. The 
recombinant enzymes were purified and kinetically characterized. We have demonstrated that 
different isoforms and its truncated forms all have distinct properties (Table 6-14, Figure 58- 
63). The specific activities (Vmax) of the purified maize SSI-1, SSI-2, and SSI-3 were 22.5, 
33.4 and 26.3 :mol glc/min/mg of protein respectively, Our results have clearly indicated that 
the catalytic center of SSI is not located in its N-terminal extension. However, N-terminal 
truncation decreased the enzyme affinity for amylopectin, with the K. for amylopectin of the 
truncated SSI-3 being about 60%-90% higher than that of the full length SSI-1 . The effects of 
N-terminal truncation of SSIIa depend upon the assay conditions used. For both SSIIa-1 and 
SSIIa-2, the V roax of each enzyme increased 2-fold upon raising assay temperature from 27 N C 
to 37 N C (Tables II and III). However, the effect of temperature on ADPGlc affinity was 
different for SSIIa-1 and SSIIa-2. For the truncated SSIIa-2, the for ADPGlc was not 
affected by raising temperature. In contrast, the K,,, of ADPGlc for the putative full length 
SSIIa-1 increased 2 fold upon raising the assay temperature from 27 N C to 37 N C (Table III). 
Interestingly, the truncated SSIIa-2 exhibited a lower K,,, for ADPGlc than SSIIa-1 did in all 
assay conditions used in this study except that they showed similar K,,, values for ADPGlc 
when glycogen was used as a primer at 27EC. Although N-terminal truncation of SSIIa 
appears to lower the K,,, for ADPGlc under most assay conditions, it also must be noted that the 
maximal velocity of the truncated SSIIa-2 is decreased by about 2-4 fold when compared to 
SSIIa-1 . The truncated SSIIb-2 was found to be more temperature stable than the longer SSIIb- 
1 in the presence of citrate, while little difference was observed in their pH activity profiles. 
While the putative full length SSIIb-1 showed a similar V max using amylopectin or glycogen as 
a primer, the N-terminally truncated SSIIb-2 showed a 40% increase in V max using glycogen 
compared with amylopectin as a primer. N-terminal truncation of SSIIb increased its V max by 
25% with amylopectin as a primer. We also demonstrated that kimeric enzymes of maize 
starch synthase (combining the C-terminal domain of SS with different N-terminal sequences 
of SS or unrelated sequences would produce a functional enzyme with SS activity and altered 
properties) (Table 15). 
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Conclusions, Ramifications, and Scope 

Accordingly, it can be seen that, according to the invention, The starch genes can 
produce new and altered starch in either host, plant or bacteria. Additionally, polysaccahrides 
very similar to corn starch can be produced in a bacterial host. 

Although the description above contains many specificities, these should not be 
construed as limiting the scope of the invention but as merely providing illustrations of some 
of the presently preferred embodiments of this invention. Various other embodiments and 
ramifications are possible within it's scope. For example, different combinations of the 
plasmids in either host for the production of useful plant and useful grain and useful 
polysaccahrides. 

Thus the scope of the invention should be determined by the appended claims and their 
legal equivalents, rather than by the examples given. All references cited herein are 
incorporated herein in their entirety by reference. 
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1 . A method of producing polysaccharides which are non glycogen like in a host 
comprising: 

a. transforming a host capable of being used in a fermentation process, with genes 
selected from the group which produce starch synthesizing enzymes, glycogen synthesizing 
enzymes such that the host produces nonglycogen like starch, and 

b. employing the host in a fermentation process wherein the fermentation process 
produces polysaccharides. 

2. A method according to claim 1 wherein the host is bacteria. 

3. A host transformed to carry a gene active in glycogen production, and at least one 
nonstarch branching gene active in the production of at least one of the following 
polysaccharides amylopectin and amylose in it original host. 

4. A host according to claim 3 wherein the host is a monocot. 

5. A host according to claim 3 wherein the host is a dicot. 

6. A host according to claim 3 wherein the host is a plant. 

7. A host according to claim 3 wherein the host is a bacteria. 

8. A host according to claim 3 wherein the host is a cereal bearing plant. 

9. A host according to claim 3 wherein the bacterial gene is selected from the group 
consisting of glgC gene, glgA gene, glgB gene. 

10. A host according to claim 3 wherein at least one nonstarch branching genes active 
in the production of at least one of the following polysaccharides amylopectin and amylose in 
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it original plant is selected from the group consisting of starch soluble starch synthase I, II, III 
genes and debranching enzyme gene (sul), GBSS gene, sh2 gene and bt2 gene. 

1 1 . A host according to claim 3 including at least one of the starch branching enzyme 

genes. 

12. A host according to claim 1 1 including the starch branching enzyme gene BEI 

gene. 

13. A host according to claim 11 including the starch branching enzyme gene BEII 

gene. 

14. A host according to claim 12 including the starch branching enzyme gene BEII 

gene. 

15. A host transformed to carry a gene active in ADPG production, and at least one 
starch gene active in the production of at least one of the following polysaccharides 
amylopectin and amylose in it original host wherein the host produces polysaccharides that are 
plant like starch and not glycogen like. 

16. A host transformed to carry a pyrophosphatase gene, glycogen synthase gene, 

17. A host according to claim 1 wherein the gene active in glycogen production is a 
bacterial gene. 

18. A host deficient in alpha 1,4 glucan synthesizing ability and alpha 1,4-1,6 
branching enzyme capability transformed to express at least one a plant starch soluble 
synthesis gene. 

19. A host according to claim 18 includin g bein g transformed to e x press at least one 

gene encoding for debranching enzyme. 
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20. A host according to claim 18wherein said gene is encoding for starch soluble 
synthase enzyme I. 

21. A host according to claim 18 wherein said gene is encoding for starch soluble 
synthase enzyme II. 

22. A host according to claim 18wherein said gene is encoding for starch soluble 
synthase enzyme III. 

23. A host according to claim 18 including being transformed to express at least one 
gene encoding for starch branching enzyme. 

24. A host according to claim 23 wherein the starch branching enzyme is BEL 

25. A host according to claim 23 wherein the starch branching enzyme is BEIL 

26. A plasmid wherein said plasmid is in a carrier host and said plasmid contains the 
SSII gene with the n terminus GENVMNVIVV wherein the gene is approximately 1561 base 
pairs in length. 

27. A Plasmid according to claim on e wherein said host is a bacterial host. 

28. A host according to claim two wherein the host is a wild type E. Coli. 

29. A new polysaccharide produced by a transformed host comprising: 
said host having a wildtype, said wildtype of said host does not produce said new 
polysaccharide, said transformed host expressing at least two exogenous starch synthesis 
genes, said genes are selected from a group consisting of soluble starch synthesis genes such as 
SSI SSII SSIII, wherein the transformed host is capable of producing such new polysaccharide. 
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30. The new polysaccharide of claim 29 wherein said host also expresses the 
exogenous genes selected from the following group consisting of bacterial glycogen inducing 
genes. 

3 1 . The new polysaccharide of claim 29 wherein said host also expresses the 
exogenous genes selected from the following group consisting of plant granule bound 
enzymes. 

32. A method according to claim one wherein the host is fungal. 

33. A method according to claim 32 wherein the host is yeast. 

34. A method according to claim one wherein said glycogen synthesizing genes 
include glgC, glgA, glgB genes. 

35. A method according to claim one wherein said genes which produce starch 
synthesizing enzymes include genes encoding for starch soluble synthases I, II, III. 

36. A method according to claim one wherein said genes which produce starch 
synthesizing enzymes include genes encoding for and debranching enzyme and branching 
enzymes. 

37. The new polysaccharide of claim 30 wherein said bacterial glycogen inducing 
genes are selected from the group consisting of glgC, glgA, glgB. 

38. The new polysaccharide of claim 29 wherein said starch synthesis genes 
are selected from the group consisting of BEI and BEII. 
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: Date: ' 




Notes: 

PEXS3C Is the 1082 bp Nde l*EcoR t 

fragment containing the N-terminus 

of MSSSHc (from MSSSIIc in pBSK) 

subctoned Into the Nde 1-EcoR I sites 

of pEXS3a, replacing the N-terminus 

of lla with the longer lie N-terminus. MSSSIIc 

is the mature maize STS2 and Is 2090 bo 

long. The following sites are NOT 

contained in the MSSSHc insert: Apa I, Bgl II, 

EcoR V, Not I, Spe l f and Xba L 

see Ping's Notebook #1 pp. 149. 159. 160. 

vector pET-21a 

N-terminus of MSSSIIc: AEAEAGGKD 
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Date: 7/31/95 P*I2875 
Notes: 

PEXSA1 is a 1551 bp Spe l-Sac I fragment 
containing gig A (from glgA tn pBSK) 
feubcloned Into the Xba l-Sac i sites of 
pET-23d. 

see "g!g A, B.C" notebook p. 71 . 
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pEXS-trc is pTrc99A-Ndel which has been 
mutagenized. (Nco I srte in multiple cloning 
site of pTrc99A-Ndel is mutagenized to Nde I 
using primers EXS63 and EXS64.) pEXS-trc 
contains only one Nde I site and no Nco I 
sites. The following sites are NOT contained 
in pEXS-trc: Bgl II, Cla I, Nco I, Not I, Sac II, 
SnaB I. Spe I, and Xho I. 
see Angela's Notebook #1 pp. 231, 242-244, 
246, 248-250. 
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Notes: 

pEXS-trc3 is pEXS-trcl cut with Bgl I (filled 
in)-Sca I and religated, deleting most of the 
Amp gene (304 nt from the 5* end remain). 
The following sites are NOT contained in 
pEXS-trc3: Apa I, Bgl II, EcoR V, Nco I. Not I, 
SnaB I, and Spe I. 

see Ping's Notebook #2 pp. 204. 209. 
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Fig M3 

^f^ASIS ***** DNA TRANSLATION LIST ***** «^dATE 04-03-97 



*** INPUT INFORMATION *** 

FILE NAME : ECGLGA1 . SEQ SEQUENCE : NORMAL 1488 BP 
CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 1488 

TRANSLATION REGION : 1 - 1488 



*** DNA TRANSLATION *** 



1 
1 


ATG 
M 


CAG 
Q 


GTT 
V 


TTA 
L 


CAT 
H 


GTA 
V 


TGT 
C 


TCA 
S 


GAG 
E 


ATG 
M 


TTC 
F 


CCG 
P 


CTG 
L 


CTT 
L 


AAA 

K 


ACC 
T 


48 
16 


49 
17 


GGC 
G 


GGT 
G 


CTG 
L 


GCT 
A 


GAT 
D 


GTT 
V 


ATT 
I 


GGG 
G 


GCA 
A 


TTA 
L 


CCC 
P 


GCA 
A 


GCA 
A 


CAA 
Q 


ATC 
I 


GCA 
A 


96 
32 


97 
33 


GAC 
D 


GGC 
G 


GTT 
V 


GAC 
D 


GCT 
A 


CGC 
R 


GTA 
V 


CTG 
L 


TTG 
L 


CCT 
P 


GCA 
A 


TTT 
F 


CCC 
P 


GAC 
D 


ATT 
I 


CGC 
R 


144 
48 


145 
49 


CGT 
R 


GGC 
G 


GTG 
V 


ACC 
T 


GAT 
D 


GCG 
A 


CAG 
Q 


GTA 
V 


GTA 

V 


TCC 
S 


CGT 
R 


CGT 
R 


GAT 
D 


ACC 
T 


TCC 
S 


GCC 
A 


192 
64 


193 
65 


GGA 
G 


CAT 
H 


ATC 
I 


ACG 
T 


CTG 
L 


TTG 
L 


TTC 
F 


GGT 
G 


CAT 
H 


TAC 
Y 


AAC 
N 


GGG 
G 


GTT 
V 


GGC 
G 


ATT 
I 


TAC 
Y 


240 
80 


241 
81 


CTG 
L 


ATT 
I 


GAC 
D 


GCG 
A 


CCG 
P 


CAT 
H 


CTC 
L 


TAT 
Y 


GAT 
D 


CGT 
R 


CCG 
P 


GGA 
G 


AGT 
S 


CCG 
P 


TAT 
Y 


CAC 
H 


288 
96 


289 
97 


GAT 
D 


ACC 
T 


AAC 
N 


TTA 
L 


TTT 
F 


GTC 
V 


CAT 
H 


ACC 
T 


GAC 
D 


AAC 
N 


GTA 
V 


TTG 
L 


CGT 
R 


TTT 
F 


GCG 
A 


CTG 
L 


336 
112 


337 
113 


CTG 
L 


GGG 
G 


TGG 
W 


GTT 
V 


GGG 
G 


GCA 
A 


GAA 
E 


ATG 
M 


GCC 
A 


AGC 
S 


GGG 
G 


CTT 
L 


GAC 
D 


CCA 
P 


TTC 
F 


TGG 
W 


384 
128 


385 
129 


CGT 
R 


CCT 
P 


GAT 
D 


GTG 
V 


GTG 
V 


CAT 
H 


GCG 
A 


CAC 
H 


GAC 
D 


TGG 
W 


CAT 
H 


GCA 
A 


GGC 
G 


CTT 
L 


GCG 
A 


CCT 
P 


432 
144 


433 
145 


GCG 
A 


TAT 

Y 


CTG 
L 


GCG 
A 


GCG 
A 


CGC 
R 


GGG 
G 


CGT 
R 


CCG 
P 


GCG 
A 


AAG 
K 


TCG 
S 


GTG 
V 


TTT 
F 


ACT 
T 


GTG 
V 


480 
160 


481 
161 


CAC 
H 


AAC 
N 


CTA 
L 


GCC 
A 


TAT 
Y 


CAA 
Q 


GGC 
G 


ATG 
M 


TTT 
F 


TAT 
Y 


GCA 
A 


CAT 
H 


CAC 
H 


ATG 
M 


AAT 
N 


GAC 
D 


528 
176 


529 
177 


ATC 
I 


CAA 
Q 


TTG 
L 


CCA 
P 


TGG 
W 


TCA 
S 


TTC 
F 


TTT 
F 


AAT 
N 


ATT 
I 


CAT 
H 


GGG 
G 


CTG 
L 


GAA 
E 


TTC 
F 


AAC 
N 


576 
192 


577 
193 


GGA 
G 


CAA 
Q 


ATC 
I 


TCT 
S 


TTC 
F 


CTG 
L 


AAG 
K 


GCC 
A 


GGT 
G 


CTG 
L 


TAC 
Y 


TAT 
Y 


GCC 
A 


GAT 
D 


CAC 
H 


ATT 
I 


624 
208 


625 


ACG 

r 


GCG 

A 


GTC 

V 


AGT 

s 


CCA 
P 


ACC 

T 


TAC 

Y 


GCT 

A 


CGC 
ft 


GAG 


ATC 


ACC 

T 


GAA 

z 


CCG 
? 


CAG 

O 


TTT 

r 


672 



BNSDOCID: <WO 9844780A1J. 



WO 98/44780 PCT/US98/06660 

Fit) 43 cwuttnued 



673 
22 t> 


GCC 
A 


TAC 
v 

X 


GGT 

G 


ATG 


GAA 

XL 


GGT 


CTG 

Xj 


TTG 

T 
Xj 


CAA 

/•> 
V 


CAG 

y 


CGT 

P 


CAC 

XI 


CGC 
p 

XV 


GAA 

XL 


GGG 

G 


CGT 

K 


72C 
24 L 


721 
241 


CTT 

Xj 


TCC 
S 


GGC 

G 


GTA 

V 


CCG 

F 


AAC 

VT 


GGC 

G 


GTG 

V 


GAC 

TV 
U 


GAG 
xl 


AAA 

xr 

XV 


ATC 

X 


TGG 
w 


AGT 
c 


CCA 

F 


GAG 

XL 


768 
256 


769 

2b / 


ACG 

m 

T 


GAC 
D 


TTA 

T 


CTG 

T 
Xj 


TTG 

T 

Xj 


GCC 
A 


TCG 
S 


CGT 
p 


TAC 

v 

X 


ACC 

X 


CGC 

TV 
K 


GAT 
u 


ACG 

rp 

X 


TTG 

Xj 


GAA 

XL 


GAT 

TV 

D 


816 
272 


817 
273 


AAA 
K 


GCG 
A 


GAA 

E 


AAT 
N 


AAG 

K 


CGC 
R 


CAG 
Q 


TCA 

s 


CAA 

Q 


ATC 

X 


GCA 
A 


ATG 

ur 
M 


GGA 

/-«« 
G 


TCC 

s 


AAG 
K 


GTT 
V 


864 
288 


665 

TOO 

289 


GAC 

TV 

u 


GAT 
u 


AAA 

xv 


GTG 

V 


CCG 

TV 

F 


CTT 

T 

Xj 


TTT 


GCA 

7V 

A 


GTG 

V 


GTG 
V 


AGC 

5 


CGT 

TV 

K 


CTG 

T 
Xj 


ACC 

rp 
T 


AGC 

S 


CAG 
Q 


912 
304 


913 


AAA 

x\ 


GGT 

G 


CTC 
Li 


GAT 

U 


TCG 

o 
o 


GTG 

V 


CTG 

Xj 


GAA 

XL 


GCC 

IV 

A 


TCA 

o 
b 


CCG 

tv 

P 


GGT 

G 


TCT 

S 


TCG 

£> 


GAG 


CAG 
Q 


96C 
32C 


961 


GGC 

G 


GGG 

G 


CAG 
Q 


CTG 

T 
JJ 


GCG 
A 


CTA 

Xj 


CTC 

T 
Xj 


GGC 

G 


GCG 

JV 

A 


GGC 

G 


GAT 

TV 


CCG 

TV 

F 


GTG 

TT 

V 


CTG 

T 

Xj 


CAG 
Q 


GAA 
E 


1006 

i *i c 

33c 


1009 

I *7 


GGT 

G 


TTC 
r 


CTT 

T 

Xj 


GCG 
A 


GCG 

A 


GCA 

TV 

A 


GCG 

TV 

A 


GAA 


TAC 

v 

X 


CCC 

T1 

F 


GGT 
G 


CAG 
Q 


GTG 

TT 

V 


GGC 
G 


GTT 

TT 

V 


CAG 
Q 


lose 

352 


1057 


ATT 

T 


GGC 

r: 


TAT 
v 

X 


CAC 

u 
n 


GAA 

XL 


GCA 

JA 

A 


TTT 
r 


TCG 


CAT 

XT 
XI 


CGC 

p 

XV 


ATT 

X 


ATG 

M 


GGC 

G 


GGC 

G 


GCG 

TV 

A 


GAC 

TV 

V 


1104 


1105 

JO? 


GTC 

V 


ATT 

T 
X 


CTG 

T 

i-> 


GTG 

V 


ccc 

p 

Xr 


AGC 
e 


CGT 

P 

Xv 


TTC 

V 
r 


GAA 

XL 


CCG 

Xr 


TGC 

r* 
v~ 


GGC 


TTA 

T 

Xj 


ACG 

rp 

X 


CAA 


CTT 

T 

Xj 


1152 


1153 
385 


TAT 
Y 


GGA 


TCG 
c 


AAG 

XT 
XV 


TAC 

Y 

X 


GGT 

d 
yj 


ACG 

TP 


CTG 

T. 
Xj 


CCG 
p 

Xr 


TTA 

T 

Xj 


GTG 

TT 

V 


CGA 

P 


CGC 
p 

XV 


ACC 

rn 
X 


GGT 

G 


GGG 

G 


1200 
4 uu 


1201 
401 


CTT 
L 


GCT 
A 


GAT 
D 


ACG 
T 


GTT 
V 


TCT 
S 


GAC 
D 


TGT 
C 


TCT 
S 


CTC 
L 


GAG 
E 


AAC 
N 


CTT 
L 


GCA 
A 


GAT 
D 


GGC 
G 


1248 
416 


1249 
417 


GTC 
V 


GCC 
A 


AAT 
N 


GGG 
G 


TTT 
F 


ATC 
I 


TTC 
F 


GAA 
£ 


GAT 
D 


AGT 

S 


AAT 
N 


GCC 
A 


TGG 
W 


TCG 
S 


CTG 
L 


TTA 
L 


1296 
432 


1297 
433 


CGG 
R 


ACT 
T 


ATT 
I 


CGA 
R 


CGT 
R 


GCT 
A 


TTT 
F 


GTA 
V 


CTG 
L 


TGG 
W 


TCC 
S 


TGT 
C 


CCT 
P 


CCA 
P 


CTG 
L 


TGG 
W 


1344 
448 


1345 
449 


CGG 
R 


TTT 
F 


GTG 
V 


CAA 
Q 


CGT 
R 


CAG 
Q 


GCT 
A 


ATG 
M 


GCA 
A 


ATG 
M 


GAT 
D 


TTT 
F 


GGC 
G 


TGG 
W 


CAG 
Q 


GTC 
V 


1392 
464 


1393 
465 


GCG 
A 


GCG 
A 


AAG 
K 


TCG 
S 


TAC 

y 


CGT 
R 


GAG 
E 


CTT 
L 


TAC 

y 


TAT 
Y 


CGC 
R 


TCG 
S 


AAA 
K 


TAG 


TTT 
F 


TCA 
S 


1440 
480 


1441 
481 


GGA 
G 


AAC 
N 


GCC 
A 


TAC 
Y 


ATG 
M 


AAT 
N 


GCT 
A 


CCG 
P 


TTT 
F 


ACA 
T 


TAT 
Y 


TCA 
S 


TCG 
S 


CCC 

p 


ACG 
T 


CTT 
L 


1488 
496 
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DNASIS ***** DNA TRANSLATION LIST ***** 

*** INPUT INFORMATION *** 

FILE NAME : ECGLGB.SEQ SEQUENCE : NORMAL 2361 BP 

CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 2361 

TRANSLATION REGION : 1 - 2361 



*** DNA TRANSLATION *** 



1 

1 

X 


ATG 


TCC 

Q 


GAT 


CGT 


ATC 

T 
X 


GAT 
u 


AGA 

K 


GAC 


GTG 

V 


ATT 

T 


AAC 

VT 
Vi 


GCG 


CTA 

T 


ATT 

T 

X 


GCA 

K 


GGC 


48 
X c 


49 

X / 


CAT 

u 
n 


TTT 
r 


GCG 

i\ 


GAT 


CCT 
p 

tr 


TTT 
r 


TCC 
e 


GTA 

V 


CTG 

T 

Xj 


GGA 


ATG 

M 

1*1 


CAT 

XJ 
XI 


AAA 

V 


ACC 

T 
X 


ACC 

T 

1 


GCG 

A 


96 


97 
33 


GGA 
G 


CTG 
L 


GAA 
E 


GTC 
V 


CGT 

R 


GCC 
A 


CTT 
L 


TTA 
L 


CCC 
p 


GAC 
D 


GCT 
A 


ACC 
T 


GAT 
D 


GTG 
V 


TGG 
W 


GTG 

v 


144 
48 


14 5 
49 


ATT 
I 


GAA 
E 


CCG 
P 


AAA 
K 


ACC 
T 


GGG 
G 


CGC 
R 


AAA 
K 


CTC 
L 


GCA 
A 


AAA 
K 


CTG 
L 


GAG 
E 


TGT 
C 


CTC 
L 


GAC 
D 


192 
64 


193 
65 


TCA 
S 


CGG 
R 


GGA 
G 


TTC 
F 


TTT 
F 


AGC 
S 


GGC 
G 


GTC 
V 


ATT 
I 


CCG 
P 


CGA 
R 


CGT 
R 


AAG 
K 


AAT 
N 


TTT 
F 


TTC 
F 


240 
80 


241 
81 


CGC 
R 


TAT 
Y 


CAG 
Q 


TTG 
L 


GCT 
A 


GTT 
V 


GTC 
V 


TGG 
W 


CAT 
H 


GGT 
G 


CAG 
Q 


CAA 
Q 


AAC 
N 


CTG 
L 


ATT 
I 


GAT 
D 


288 
96 


289 
97 


GAT 
D 


CCT 
P 


TAC 
Y 


CGT 
R 


TTT 
F 


GGT 
G 


CCG 
P 


CTA 
L 


ATC 
I 


CAG 
Q 


GAA 
E 


ATG 
M 


GAT 
D 


GCC 
A 


TGG 
W 


CTA 
L 


336 
112 


337 
113 


TTA 
L 


TCT 
S 


GAA 
E 


GGT 
G 


ACT 
T 


CAC 
H 


CTG 
L 


CGC 
R 


CCG 
P 


TAT 
Y 


GAA 
E 


ACC 
T 


TTA 
L 


GGC 
G 


GCG 
A 


CAT 
H 


384 
128 


385 
129 


GCA 
A 


GAT 
D 


ACT 
T 


ATG 
M 


GAT 
D 


GGC 
G 


GTC 
V 


ACA 
T 


GGT 
G 


ACG 
T 


CGT 
R 


TTC 
F 


TCT 
S 


GTC 
V 


TGG 
W 


GCT 
A 


432 
144 


433 
145 


CCA 
P 


AAC 
N 


GCC 
A 


CGT 
R 


CGG 
R 


GTC 
V 


TCG 
S 


GTG 
V 


GTT 
V 


GGG 
G 


CAA 
Q 


TTC 
F 


AAC 
N 


TAC 
Y 


TGG 
W 


GAC 
D 


480 
160 


481 
161 


GGT 
G 


CGC 
R 


CGT 
R 


CAC 
H 


CCG 
P 


ATG 
M 


CGC 
R 


CTG 
L 


CGT 
R 


AAA 
K 


GAG 
E 


AGC 
S 


GGC 
G 


ATC 
I 


TGG 
W 


GAA 
E 


528 
176 


529 
177 


CTG 
L 


TTT 
F 


ATC 
I 


CCT 
P 


GGG 
G 


GCG 
A 


CAT 
H 


AAC 
N 


GGT 
G 


CAG 
Q 


CTC 
L 


TAT 
Y 


AAA 
K 


TAC 
Y 


GAG 
E 


ATG 
M 


576 
192 


577 
193 


ATT 
I 


GAT 
D 


GCC 
A 


AAT 
N 


GGC 
G 


AAC 
N 


TTG 
L 


CGT 
R 


CTG 
L 


AAG 
K 


TCC 
S 


GAC 
D 


CCT 
P 


TAT 
Y 


GCC 
A 


TTT 
F 


624 
208 


625 


GAA 


GCG 
/\ 


CAA 

a 


ATG 

tA 


CGC 

R 


CCG 
P 


GAA 

EL 


ACC 

T 


GCG 
A 


TCT 

s 


CTT 


ATT 

X 


TGC 
C 


GGG 

Gr 


CTG 


CCG 
P 


672 

224 



DATE 04-03-97 



BNSDOCID: <WO_9844780A1_I_> 



WO 98/44780 PCT/US98/06660 



673 GAA AAG GTT GTA CAG ACT GAA GAG CGC AAA AAA GCG AAT CAG TTT GAT 720 

225 EKVVQTEERKKANQFD 240 

721 GCG CCA ATC TCT ATT TAT GAA GTT CAC CTG GGT TCC TGG CGT CGC CAC 768 

241 APISIYEVHLGSWRRH 256 

769 ACC GAC AAC AAT TTC TGG TTG AGC TAC CGC GAG CTG GCC GAT CAA CTG 816 

257 TDNNFWLSYRELADQL 272 

817 GTG CCT TAT GCT AAA TGG ATG GGC TTT ACC CAC CTC GAA CTA CTG CCC 864 

273 VPYAKWMGFTHLELLP 288 

865 ATT AAC GAG CAT CCC TTC GAT GGC AGT TGG GGT TAT CAG CCA ACC GGC 912 

289 INEHPFDGSW GYQPTG 304 

913 CTG TAT GCG CCA ACC CGC CGT TTT GGT ACT CGC GAC GAC TTC CGT TAT 960 

305 LYAPTRRFGTRDD FRY 32C 

961 TTC ATT GAT GCC GCA CAC GCA GCT GGT CTG AAC GTG ATT CTC GAC TGG 100E 

321 FIDAAHAAGLNVILDW 336 

1009 GTG CCA GGC CAC TTC CCG ACT GAT GAC TTT GCG CTT GCC GAA TTT GAT ... .1056 

337 VPGHFPTDDFALAEFD 352 

1057 GGC ACG AAC TTG TAT GAA CAC AGC GAT CCG CGT GAA GGC TAT CAT CAG 1104 

353 GTNLYEH SDPREGYHQ 368 

1105 GAC TGG AAC ACG CTG ATC TAC AAC TAT GGT CGC CGT GAA GTC AGT AAC 1152 

369 DWNTLIYNYGRREVSN 384 

1153 TTC CTC GTC GGT AAC GCG CTT TAC TGG ATT GAA CGT TTT GGT ATT GAT 120C 

385 FLVGNALYWIERFGID 400 

1201 GCG CTG CGC GTC GAT GCG GTG GCG TCA ATG ATT TAT CGC GAC TAC AGC 1248 

401 ALRVDAVASMIYRDY S 416 

1249 CGT AAA GAG GGG GAG TGG ATC CCG AAC GAA TTT GGC GGG CGC GAG AAT 1296 

417 RKEGEWIPNEFGGREN 432 

1297 CTT GAA GCG ATT GAA TTC TTG CGT AAT ACC AAC CGT ATT CTT GGT GAG 1344 

433 LEAIEFLRNTNRILGE 448 

1345 CAG GTT TCC GGT GCG GTG ACA ATG GCT GAG GAG TCT ACC GAT TTC CCT 13 92 

449 QVSG AVTMAEESTDFP 464 

1393 GGC GTT TCT CGT CCG CAG GAT ATG GGC GGT CTG GGC TTC TGG TAC AAG 1440 

465 GVSRPQDMGGLGFWYK 480 

1441 TGG AAC CTC GGC TGG ATG CAT GAC ACC TTG GAC TAC ATG AAG CTC GAC 1488 

481 WNLGWMHDTLDYMKLD 496 

1489 CCG GTT TAT CGT CAG TAT CAT CAC GAT AAA CTG ACC TTC GGG ATT CTC 1536 

497 PVYRQYHHDKLTFGIL 512 

1537 TAC AAC TAC ACT GAA AAC TTC GTC CTG CCG TTG TCG CAT GAT GAA GTG 1584 

513 YNYTENFVLPLSHDEV 528 

158 5 GTC CAC GGT AAA AAA TCG ATT CTC GAC CGC ATG CCG GGC GAC GCA TGG 1632 
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1633 CAG AAA TTC GCG AAC CTG CGC GCC TAC TAT GGC TGG ATG TGG GCA TTC 168 C 

545 QKFANLRAYYGWMWAF 56C 

1681 CCG GGC AAG AAA CTA CTG TTC ATG GGT AAC GAA TTT GCC CAG GGC CGC 1728 

561 PGKKLLFMGNE FAQGR 576 

1729 GAG TGG AAC CAT GAC GCC AGC CTC GAC TGG CAT CTG TTG GAA GGC GGC 1776 

577 EWNHDASLDWHLLEGG 592 

1777 GAT AAC TGG CAC CAC GGT GTC CAG CGT CTG GTG CGC GAT CTG AAC CTC 1824 

593 DNWHHGVQRLVRDLNL 608 

1825 ACC TAC CGC CAC CAT AAA GCA ATG CAT GAA CTG GAT TTT GAC CCG TAC 1872 

609 TYRHHKAMHELDFDPY 624 

1873 GGC TTT GAA TGG CTG GTG GTG GAT GAC AAA GAA CGC TCG GTG CTG ATC 192 C 

625 GFEWLVVDDKERSVLI 64C 

1921 TTT GTG CGT CGC GAT AAA GAG GGT AAC GAA ATC ATC GTT GCC AGT AAC 196S 

641 FVRRDKEGNEI IVASN 656 

1969 TTT ACG CCG GTA CCG CGT CAT GAT TAT CGC TTC GGC ATA AAC CAG CCG 2016 

657 FTPVPRHDYRFGINQP 672 

2017 GGC AAA TGG CGT GAA ATC CTC AAT ACC GAT TCC ATG CAC TAT CAC GGC 2064 

673 GKWREILNTDSMHYHG 688 

2065 AGT AAT GCA GGC AAT GGC GGC ACG GTA CAC AGC GAT GAG ATT GCC AGC 2112 

689 SNAGNGG TVHSDEIAS 704 

2113 CAC GGT CGT CAG CAT TCA CTA AGC CTG ACG CTA CCA CCG CTG GCC ACT 2160 

705 HGRQHSLSLTLPPLAT 720 

2161 ATC TGG CTG GTT CGG GAG GCA GAA TGA CAC AAC TCG CCA TTG GCA AAC 2208 

721 IWLVREAE*HNSPLAN 736 

2209 CCG CTC CCC TCG GCG CGC ATT ACG ACG GTC AGG GCG TCA ACT TCA CAC 2256 

737 PLPSARITTVRASTSH 752 

2257 TTT TCT CCG CTC ATG CCG AGC GGG TAG AAC TGT GTG TCT TTG ACG CCA 23 04 

753 FSPLMPSG*NCV SLTP 768 

2305 ATG GCC AGG AAC ATC GCT ATG ACT TGC CAG GGC ACA GTG GCG ACA TTT 2352 

769 MARNIAMTCQGTVATF 784 

2353 GGC ACG GTT 2361 

785 G T V 787 
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LOCUS 
UL-1994 
DEFINITION 
ACCESSION 
NID 

KEYWORDS 
otein . 
SOURCE 

ORGANISM 

psida; 

REFERENCE 

AUTHORS 

TITLE 
-rich 10 -kDa 

JOURNAL 

MEDLINE 
COMMENT 
FEATURES 

source 



MZEZEIN10K 2562 bp 



DNA 



PLN 



Zea mays 10-kDa zein gene, complete cds. 
M23537 
g340933 

methionine-rich protein; seed storage protein; 



19-J 



zem pr 



Zea mays 3-week old seedling leaf DNA. 
Zea mays 

Eukaryota; Plantae; Embryobionta ; Magnoliophyta; Lilio 



Commelinidae; Cyperales; Poaceae 
1 (bases 1 to 2562) 
Kirihara, J.A. , Petri, J. B. 
Isolation and sequence of 



and Messing, J. 
a gene encoding a 



methionine 



zein protein from maize 
Gene 71, 359-370 (1988) 
89138012 
NCBI gi: 



TATA_s ignal 

mRNA 

CDS 



MPLGTMNPCMQYC 
MPSMMSPMVLPSM 



AAF" 



sig_peptide 
mat_peptide 



polyA_signal 
BASE COUNT 782 
ORIGIN 



340933 

Location/ Qualifiers 
1. .2562 

/organism="Zea mays" 
/cell_line=" inbred BSSS-53" 
/dev_stage*="3-week old seedling" 
/ seguenced_mol="DNA" 
/tissue_type=" leaf" 
1044. .1050 
1082. .1685 
1137. .1589 

/note="l0 kDa; NCBI gi: 511870" 
/codon_start-l 
/product="zein protein" 
/db_xref="PID:g511870" 

/ trans lation="MAAKMLALFALLALCASATSATHIPGHLPPV 
MMQQGLASLMACPSLMLQQLLALPLQTMPVMMPQMMTPNMMSPLM 
MSQIMMPQCHCDAVSQIMLQQQIiPFMFNPMAMTIPPMFLQQPFVG 

1137. .1199 
/codon_start=l 
1200. .1586 
/note="lO kDa" 
/ codon_start=l 
/product="zein protein" 
1655. .1660 
i 506 c 471 g 803 t 



MZEZEIN10K Length: 2562 May 29, 1996 15:41 Type: N Check: 840 
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8 



1 


AAGCTTGCTA 


CTTTCTTTCC 


TTAATGTTGA 


TTTCCCCTTT 


GTTAGATGTT 


51 


CTTTGTGTTA 


TATACACTCT 


GTATACAAGG 


ATGCGATACA 


CACATCAGCT 


101 


AGTCCTAATG 


ATGCCACCGA 


CTTTACTTGA 


GGAAAAGGAA 


ACAAATATGA 


151 


TGTGGCCATC 


ACATTCTCAA 


T AACAATG A C 


CATGTGCGCA 


ATGACATACC 


201 


ATCATATTTG 


ATATCATAAA 


AATAAATTTA 


TTATCAAAGT 


AAACATATAG 


251 


TTCATATATC 


AGATATTAAA 


GTGATAAGAA 


CAAATATTAC 


ATTTTATCTT 


301 


ATATAAAATG 


ACGAAAAAGG 


TACGAGTTGA 


AAAGGAGTCC 


AACCCCTTTT 


351 


TTATAGCTTG 


TTCGGTTGCT 


TGTTCTCTTC 


GGCTAGCGAG 


GTGGTAGAAT 


401 


GTGAGAGTGT 


TGCGCGTGGA 


TTCCCGTCGT 


AGTGTTCTTA 


GGTGATTTCT 


451 


CACGGCCCAT 


CTGTGATATA 


GCGACTCATA 


TGTGGTGTAA 


TAGCCCATTG 


501 


GGAGAAGGGG 


AGAGATATAG 


ATCTACGTGA 


TTTGCACGTG 


ATGCACGACG 


551 


AACGAAACTG 


GTGGTTTAAA 


GTAGTAGAGG 


TTTGTCATTA 


GAGGTGTAAA 


601 


TGGTACATAT 


ATTATCCGTT 


CATATTCGAA 


TTTGATCCGT 


ATAAGAGGGC 


651 


TAAGATCTAA 


TCCGTATACA 


AGTCCAAGTA 


TTAAGTATCC 


GATCCATATC 


701 


GGATCTTTAT 


CCGTATCCGT 


ATACTCAAAA 


TTTGATGTTT 


AAGATTTTAA 


751 


TATATATTTA 


AACTTTATAG 


GAACTCGATA 


ATATTTGTAT 


CTGATTTGAA 


801 


TTATGAAAAC 


AAATATGGAA 


CGATTAATTT 


CAGTCTATAT 


CCGTTCCGAT 


851 


ATTTGTCATG 


CTTTGCTAAA AATACCTTTA 


CAAGGCATCT 


TGTGCAGATT 


901 


ATATATTAAT 


CTGAAATCAG 


TTAGAGAAGC 


CTACAAATTT 


GACCAAATGC 


951 


CGAGTCATCC 


GGCTTATCCC 


CTTTCCAACT 


TTCAGTTCTG 


CAAGCGCCAG 


1001 


AAATCGTTTT 


TCATCTACAT 


TGTCTTTGTT 


GCCTGCATAC 


ATCTATAAAT 


1051 


AGGACCTGCT 


AGATCAATCG 


CAGTCCATCG 


GCCTCAGTCG 


CACATATCTA 


1101 


CTATACTATA 


CTCTAGGAAG 


CAAGGACACC 


ACCGCCATGG 


CAGCCAAGAT 


1151 


GCTTGCATTG 


TTCGCTCTCC 


TAGCTCTTTG 


TGCAAGCGCC 


ACTAGTGCGA 


1201 


CCCATATTCC 


AGGGCACTTG 


CCACCAGTCA 


TGCCATTGGG 


TACCATGAAC 
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1251 


CCATGCATGC 


AGTACTGCAT 


GATGCAACAG 


GGGCTTGCCA 


GCTTGATGGC 


1301 


GTGTCCGTCC 


CTGATGCTGC 


AGCAACTGTT 


GGCCTTACCG 


CTTCAGACGA 


1351 


TGCCAGTGAT 


GATGCCACAG 


ATGATGACGC 


CTAACATGAT 


GTCACCATTG 


1401 


ATGATGCCGA 


GCATGATGTC 


ACCAATGGTC 


TTGCCGAGCA 


TGATGTCGCA 


1451 


AATAATGATG 


CCACAATGTC 


ACTGCGACGC 


CGTCTCGCAG 


ATTATGCTGC 


1501 


AACAGCAGTT 


ACCATTCATG 


TTCAACCCAA 


TGGCCATGAC 


GATTCCACCC 


1551 


ATGTTCTTAC 


AGCAACCCTT 


TGTTGGTGCT 


GCATTCTAGA 


TAGAAATATT 


1601 


TGTGTTGTAT 


CGAATAATGA 


GTTGACATGC 


CATCGCGTGT 


GACTCATTAT 


1651 


TAACAATAAA 


ACAAGTTTCC 


TCTTATTATC 


TTTTTATATC 


TCTCCCTATC 


1701 


CATTTTTGCA 


AAGCCCATTA 


TCCTTTACTC 


CCTAAGTCCC 


AATATATTTT 


1751 


AGACCTTAAA 


TTGTATGTCT 


ATATTCAAAA 


GAATGACAAT 


AAATCTAGAC 


1801 


ATATATATAA 


AACACATACA 


TTAAGTATTG 


TATGAATCTA 


TTAAAATGCT 


1851 


AAAACGACTA 


ATATTATGGG 


ACGGAGGGAG 


TACTTTATTA 


GTAGATTACA 


1901 


TTGTTATTTT 


CTCTATTCCA 


AATATAAGTC 


TGGTTTTT C A 


AT CAATCAAT 


1951 


ATATATTACC 


ATGTCCAAAC 


ATTTTGAATT 


ATATATCTAG 


GTGCAGCATC 


2001 


CGTGCACGAT 


CGTAAAAGAA 


GCAGTCACGG 


TGTTGGTCCC 


AAAAACTAAT 


2051 


CGTCCGTTGT 


CGGTCACCTA 


TAAAGATTCA 


TGAAGAGAAC 


CAAAATAAGG 


2101 


CAATATAATT 


AATGTAATAT 


GACTCCTCCT 


TTTGAATTAC 


TTAGGAATAA 


2151 


CATAAGCAAA 


CAAAAAAAGG 


AGAAGATCAA 


GGTAAATAAA 


GGCATTTTGT 


2201 


GAGAAAACAT 


GGAAGCATAA 


GAATGCATAA 


GTAATGATTT 


GTGTCTCTTT 


2251 


ATATTTTTTT 


TATTCACGTG 


AATTTACATA 


GATACCATCG 


GATGTTCGAT 


2301 


GGTAATACAA 


TGATGCCTTA 


GCTCCGAGAG 


CTTCGAATGA 


TGAGCGATTT 


2351 


AAAAATACTC 


CTATCAATTG 


TTCGAAAGTT 




TGCATGGGCA 


2401 


ATGTACCTCT 


ATTTATAGGG 


ACGGTGCGAC 


GTACAAATTT 


GTATAAAATT 


2451 


ATATTTTTAT 


TCCCAAATCC 


TATGCATATG 


TGTCGGGGAC 


CATAATTAGG 
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2501 GGTACCCTCA AGGCTCCTAA TTCTCAGCTG GTAACCCCAT CAGCATAAAG 
2551 CTGCAAAGGC CT 
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4 
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si/io 



H 9 45b 



LOCUS MZEZEIN10K 2562 bp DNA PLN 19-JUL-1994 

DEFINmON Zea mays 10-kDa zein gene, complete cds. 
ACCESSION M23537 
NID g340933 

KEYWORDS methionine-rich protein; seed storage protein; zein protein. 
SOURCE Zea mays 3-week old seedling leaf DNA. 
ORGANISM Zea mays 

Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; 
Commelinidae; Cyperales; Poaceae. 
REFERENCE 1 (bases 1 to 2562) 
AUTHORS KiriharaJ.A., PetriJ.B. and MessingJ. 

TITLE Isolation and sequence of a gene encoding a methionine-rich 10-kDa 

zein protein from maize 
JOURNAL Gene 71, 359-370 (1988) 
MEDLINE 89138012 
COMMENT NCBI gi: 340933 
FEATURES Location/Qualifiers 
source 1. .2562 

/organism="Zea mays" 
/cell_line= f, inbred BSSS-53" 
/dev_stage="3-week old seedling" 
/sequenced_mol="DNA" 
/tissue_type=" leaf ' 
TATA_signal 1044. . 1050 
mKNA 1082. .1685 

CDS 1137. .1589 

/note=" 1 0 kDa; NCBI gi: 5 11 870" 
/codon_start=l 
/product="zein protein" 
/dbjcref="PID:g51 1870" 

Aranslation="MAAKMLALFALLALCASATSATHPGHL^ 

^QQGLASLMACPSLMLQQLLALPLQTMPVMMPQMMTPNMM 

/codon_start=l 
mat_peptide 1 200. .1586 
/note="10 kDa" 
/codon_start=l 
/product="zein protein" 
polyA_signal 1655. .1660 
BASE COUNT 782 a 506 c 471 g 803 1 
ORIGIN 



BNSDOCID: <WO 9844780A1_I_> 



WO 98/44780 » 

PCT/US98/06660 



m 



Fio> 45b 



MZEZEIN10K Length: 2562 May 29, 1996 15:41 Type:N Check: 8408 .. 

1 AAGCTTGCTA CTTTCTTTCC TTAATGTTGA TTTCCCCTTT GTTAGATGTT 

5 1 CTTTGTGTTA TATACACTCT GTATACAAGG ATGCGATACA CAC ATCAGCT 

101 AGTCCTAATG ATGCCACCGA CTTTACTTGA GGAAAAGGAA ACAAATATGA 

151 TGTGGCCATC ACATTCTCAA TAACAATGAC CATGTGCGCA ATGACATACC 

i 

201 ATCATATTTG ATATCATAAA AATAAATTTA TTATCAAAGT AAACATATAG 

251 TTCATATATC AGATATTAAA GTGATAAGAA CAAATATTAC ATTTTATCTT 

301 ATATAAAATG ACGAAAAAGG TACGAGTTGA AAAGGAGTCC AACCCCTTTT 

351 TTATAGCTTG TTCGGTTGCT TGTTCTCTTC GGCTAGCGAG GTGGTAGAAT 

401 GTGAGAGTGT TGCGCGTGGA TTCCCGTCGT AGTGTTCTTA GGTGATTTCT 

45 1 CACGGCCCATCTGTGATATAGCGACTCATATATGTGGTGTAATAGCCCATTG 

50 1 GGAGAAGGGG AGAGATATAG ATCTACGTGA TTTGC ACGTG ATGC ACGACG 

55 1 AACGAAACTG GTGGTTTAAA GTAGTAGAGG TTTGTCATTA GAGGTGTAAA 

601 TGGTACATAT ATTATCCGTT CATATTCGAA TTTGATCCGT ATAAGAGGGC 

65 1 TAAGATCTAA TCCGTATACA AGTCCAAGTA TTAAGTATCC GATCCATATC 

70 1 GGATCTTTAT CCGTATCCGT ATACTCAAAA TTTGATGTTT AAGATTTTAA 

75 1 TATATATTTA AACTTTATAG GAACTCGATA ATATTTGTAT CTGATTTGAA 

801 TTATGAAAAC AAATATGGAA CGATTAATTT CAGTCTATAT CCGTTCCGAT 

85 1 ATTTGTCATG CTTTGCTAAA AATACCTTTA C AAGGC ATCT TGTGC AGATT 

901 ATATATTAAT CTGAAATCAG TTAGAGAAGC CTACAAATTT GACCAAATGC 

95 1 CGAGTCATCC GGCTTATCCC CTTTCCAACT TTCAGTTCTG CAAGCGCCAG 

1001 AAATCGTTTT TCATCTACAT TGTCTTTGTT GCCTGCATAC ATCTATAAAT 

1051 AGGACCTGCT AGATCAATCG CAGTCCATCG GCCTCAGTCG CACATATCTA 
1101 CTATACTATA CTCTAGGAAG CAAGGACACC ACCGCCATG 



BNSDOCID: <WO 9844780A1 J_> 



WO 98/44780 



PCT/US98/06660 



54/10 

DNASIS ***** DNA TRANSLATION LIST ***** DATE 04-03-97 

*** INPUT INFORMATION *** 

FILE NAME : ECGLGC3 • SEQ SEQUENCE : NORMAL 1328 BP 

CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 1328 

TRANSLATION REGION : 1 - 1326 



*** DNA TRANSLATION *** 



1 
1 


ATG 
M 


GTT 
V 


AGT 
S 


TTA 
L 


GAG 
E 


AAG 
K 


AAC 
N 


GAT 
D 


CAC 
H 


TTA 
L 


ATG 
M 


TTG 
L 


GCG 
A 


CGC 
R 


CAG 
Q 


CTG 
L 


4c 
It 


49 
17 


CCA 
P 


TTG 
L 


AAA 
K 


TCT 
S 


GTT 
V 


GCC 
A 


CTG 
L 


ATA 
I 


CTG 
L 


GCG 
A 


GGA 
G 


GGA 
G 


CGT 
R 


GGT 
G 


ACC 
T 


CGC 
R 


9c 
32 


97 
33 


CTG 
L 


AAG 
K 


GAT 
D 


TTA 
L 


ACC 
T 


AAT 
N 


AAG 
K 


CGA 
R 


GCA 
A 


AAA 

K 


CCG 
P 


GCC 
A 


GTA 
V 


CAC 
H 


TTC 
F 


GGC 
G 


144 
48 


145 
49 


GGT 
G 


AAG 
K 


TTC 
F 


CGC 
R 


ATT 
I 


ATC 
I 


GAC 
D 


TTT 
F 


GCG 
A 


CTG 
L 


TCT 
S 


AAC 
N 


TGC 
C 


ATC 
I 


AAC 
N 


TCC 
S 


192 
64 


193 
65 


GGG 
G 


ATC 
I 


CGT 
R 


CGT 
R 


ATG 
M 


GGC 
G 


GTG 
V 


ATC 
I 


ACC 
T 


CAG 

Q 


TAC 
Y 


CAG 
Q 


TCC 
S 


CAC 
H 


ACT 
T 


CTG 
L 


24C 
80 



241 GTG CAG CAC ATT CAG CGC GGC TGG TCA TTC TTC AAT GAA GAA ATG AAC 28 6 

81VQHIQRGWS FFNE EMN 96 



289 
97 


GAG 
E 


TTT 
F 


GTC 

V 


GAT 
D 


CTG 
L 


CTG 
L 


CCA 
P 


GCA 
A 


CAG 
Q 


CAG 
Q 


AGA 
R 


ATG 
M 


AAA 

K 


GGG 
G 


GAA 

E 


AAC 
N 


336 
112 


337 
113 


TGG 
W 


TAT 
Y 


CGC 
R 


GGC 
G 


ACC 
T 


GCA 
A 


GAT 
D 


GCG 
A 


GTC 
V 


ACC 
T 


CAA 
Q 


AAC 
N 


CTC 
L 


GAC 
D 


ATT 
I 


ATC 
I 


384 
128 


385 
129 


CGT 
R 


CGT 
R 


TAT 
Y 


AAA 
K 


GCG 
A 


GAA 
E 


TAC 
Y 


GTG 
V 


GTG 
V 


ATC 
I 


CTG 
L 


GCG 
A 


GGC 
G 


GAC 
D 


CAT 
H 


ATC 
I 


432 
144 


433 
145 


TAC 
Y 


AAG 
K 


CAA 
Q 


GAC 
D 


TAC 
Y 


TCG 
S 


CGT 
R 


ATG 
M 


CTT 
L 


ATC 
I 


GAT 
D 


CAC 
H 


GTC 
V 


GAA 
E 


AAA 

K 


GGT 
G 


480 
160 


481 
161 


GTA 
V 


CGT 
R 


TGT 
C 


ACC 
T 


GTT 
V 


GTT 
V 


TGT 
C 


ATG 
M 


CCA 
P 


GTA 
V 


CCG 
P 


ATT 
I 


GAA 
E 


GAA 
E 


GCC 
A 


TCC 
S 


528 
176 


529 
177 


GCA 
A 


TTT 
F 


GGC 
G 


GTT 
V 


ATG 
M 


GCG 
A 


GTT 
V 


GAT 
D 


GAG 
E 


AAC 
N 


GAT 
D 


AAA 
K 


ACT 
T 


ATC 
I 


GAA 
E 


TTC 
F 


576 
192 


577 
193 


GTG 
V 


GAA 
E 


AAA 

K 


CCT 
P 


GCT 
A 


AAC 
N 


CCG 
P 


CCG 
P 


TCA 
S 


ATG 
M 


CCG 
P 


AAC 
N 


GAT 
D 


CCG 
P 


AGC 
S 


AAA 
K 


624 
208 


625 


TCT 

s 


CTG 
• 

1— 


GCG 

A 


AGT 

s 


ATG 
M 


GGT 
Gr 


ATC 

X 


TAC 

V 


GTC 

V 


TTT 

F- 


GAC 
D 


GCC 


GAC 

D 


TAT 

y 


CTG 
L~ 


TAT 

y 


672 

ZZ L i 



BNSDOCID: <WO 9844780A1_L> 



WO 98/44780 



PCT/US98/06660 




673 
225 


GAA 
E 


CTG 
L 


CTG 
L 


GAA 
E 


GAA 
E 


GAC 
D 


GAT 
D 


CGC 
R 


GAT 
D 


GAG 
E 


AAC 
N 


TCC 
S 


AGC 
S 


CAC 
H 


GAC 
D 


TTT 
F 


720 
24C 


721 
241 


GGC 
G 


AAA 
K 


GAT 
D 


TTG 
L 


ATT 
I 


ccc 
p 


AAG 
K 


ATC 
I 


ACC 
T 


GAA 
E 


GCC 
A 


GGT 
G 


CTG 
L 


GCC 
A 


TAT 
Y 


GCG 
A 


768 
256 


769 
257 


CAC 
H 


CCG 
P 


TTC 
F 


CCG 
P 


CTC 
L 


TCT 
S 


TGC 
C 


GTA 
V 


CAA 
Q 


TCC 
S 


GAC 
D 


CCG 
P 


GAT 
D 


GCC 
A 


GAG 
E 


CCG 
P 


816 
272 


817 
273 


TAC 

y 


TGG 
W 


CGC 
R 


GAT 
D 


GTG 
V 


GGT 
G 


ACG 
T 


CTG 
L 


GAA 
E 


GCT 
A 


TAC 
Y 


TGG 
W 


AAA 
K 


GCG 
A 


AAC 
N 


CTC 
L 


864 
288 


865 
289 


GAT 
D 


CTG 
L 


GCC 
A 


TCT 
S 


GTG 
V 


GTG 
V 


GAC 
D 


AAA 
K 


CTG 
L 


GAT 
D 


ATG 
M 


TAC 
Y 


GAT 
D 


CGC 
R 


AAT 
N 


TGG 
W 


912 
304 


913 
305 


CCA 
P 


ATT 
I 


CGC 
R 


ACC 
T 


TAC 
Y 


AAT 
N 


GAA 
E 


TCA 
S 


TTA 
L 


CCG 
P 


CCA 
P 


GCG 
A 


AAA 
K 


TTC 
F 


GTG 
V 


CAG 
Q 


96C 
32C 


961 
321 


GAT 
D 


CGC 
R 


TCC 
S 


GGT 
G 


AGC 
S 


CAC 
H 


GGG 
G 


ATG 
M 


ACC 
T 


CTT 
L 


AAC 
N 


TCA 
S 


CTG 
L 


GTT 
V 


TCC 
S 


GGC 
G 


100£ 
336 


1009 
337 


GGT 
G 


TGT 
C 


GTG 
V 


ATC 
I 


TCC 
S 


GGT 
G 


TCG 
S 


GTG 
V 


GTG 
V 


GTG 
V 


CAG 
Q 


TCC 
S 


GTT 
V 


CTG 
L 


TTC 
F 


TCG 
S 


1056 
352 


1057 
353 


CGC 
R 


GTT 
V 


CGC 
R 


GTG 
V 


AAT 
N 


TCA 

s 


TTC 
F 


TGC 
C 


AAC 
N 


ATT 
I 


GAT 
D 


TCC 

s 


GCC 
A 


GTA 
V 


TTG 
L 


TTA 
L 


1104 
368 


1105 
369 


CCG 
P 


GAA 
E 


GTA 
V 


TGG 
W 


GTA 

V 


GGT 
G 


CGC 
R 


TCG 
S 


TGC 
C 


CGT 
R 


CTG 
L 


CGC 
R 


CGC 
R 


TGC 
C 


GTC 
V 


ATC 
I 


1152 
384 


1153 
385 


GAT 
D 


CGT 
R 


GCT 
A 


TGT 
C 


GTT 
V 


ATT 
I 


CCG 
P 


GAA 
E 


GGC 
G 


ATG 
M 


GTG 
V 


ATT 
I 


GGT 
G 


GAA 
E 


AAC 
N 


GCA 
A 


120C 
40C 


1201 
401 


GAG 
E 


GAA 
E 


GAT 
D 


GCA 
A 


CGT 
R 


CGT 
R 


TTC 
F 


TAT 
Y 


CGT 
R 


TCA 
S 


GAA 
E 


GAA 
E 


GGC 
G 


ATC 
I 


GTG 
V 


CTG 
L 


1246 
416 


1249 
417 


GTA 
V 


ACG 
T 


CGC 
R 


GAA 
E 


ATG 
M 


CTA 
L 


CGG 
R 


AAG 
K 


TTA 
L 


GGG 
G 


CAT 
H 


AAA 

K 


CAG 
Q 


GAG 
E 


CGA 
R 


TAA 
* 


1296 
432 


1297 
433 


TGC 
C 


AGG 
R 


TTT 
F 


TAC 
Y 


ATG 
M 


TAT 

Y 


GTT 
V 


CAG 
Q 


AGA 
R 


TGT 
C 


TT 












1328 
442 



BNSDOCID: <WO 9844780A1_I_> 



WO 98/44780 
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DNASIS ***** DNA TRANSLATION LIST ***** \J DATE 04-03-97 

*** INPUT INFORMATION *** 

FILE NAME : ECGLGC.DNA SEQUENCE : NORMAL 1328 BP 

CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 1328 

TRANSLATION REGION : 1 - 1326 

*** DNA TRANSLATION *** 



1 

1 


ATG 
M 


GTT 
V 


AGT 

s 


TTA 

T, 


GAG 

Hi 


AAG 

zr 
t\ 


AAC 


GAT 

U 


CAC 

XT 

xi 


TTA 

T 

ll 


ATG 
M 


TTG 
L 


GCG 
A 


CGC 
R 


CAG 
Q 


CTG 
L 


4F 
16 


49 
17 


CCA 
p 


TTG 
L 


AAA 


TCT 

e 
o 


GTT 

V 


GCC 

A 


CTG 

T 


ATA 

± 


CTG 

T 


GCG 
A 


GGA 
G 


GGA 
G 


CGT 
R 


GGT 
G 


ACC 
T 


CGC 
R 


96 

32 


97 
33 


CTG 
L 


AAG 
K 


GAT 
D 


TTA 
L 


ACC 
T 


AAT 
N 


AAG 
K 


CGA 
R 


GCA 
A 


AAA 
K 


CCG 
p 


GCC 
A 


GTA 

v 


CAC 
H 


TTC 
F 


GGC 

(Z 


144 


145 
49 


GGT 
G 


AAG 
K 


TTC 
F 


CGC 
R 


ATT 
I 


ATC 
I 


GAC 
D 


TTT 
F 


GCG 
A 


CTG 
L 


TCT 
S 


AAC 
N 


TGC 
C 


ATC 
I 


AAC 
N 


TCC 
S 


192 
64 


193 
65 


GGG 
G 


ATC 
I 


CGT 
R 


CGT 
R 


ATG 
M 


GGC 
G 


GTG 
V 


ATC 
I 


ACC 
T 


CAG 
Q 


TAC 
Y 


CAG 
Q 


TCC 

s 


CAC 
H 


ACT 
T 


CTG 
L 


24C 
8C 


241 
81 


GTG 
V 


CAG 
Q 


CAC 
H 


ATT 
I 


CAG 
Q 


CGC 
R 


GGC 
G 


TGG 
W 


TCA 
S 


TTC 
F 


TTC 
F 


AAT 
N 


GAA 
E 


GAA 
E 


ATG 
M 


AAC 
N 


28£ 
96 


289 
97 


GAG 
E 


TTT 
F 


GTC 
V 


GAT 
D 


CTG 
L 


CTG 
L 


CCA 
P 


GCA 
A 


CAG 
Q 


CAG 
Q 


AGA 
R 


ATG 
M 


AAA 
K 


GGG 
G 


GAA 
E 


AAC 
N 


336 
112 


337 
113 


TGG 
W 


TAT 
Y 


CGC 
R 


GGC 
G 


ACC 
T 


GCA 
A 


GAT 
D 


GCG 
A 


GTC 
V 


ACC 
T 


CAA 
Q 


AAC 
N 


CTC 
L 


GAC 
D 


ATT 
I 


ATC 
I 


384 
128 


385 
129 


CGT 
R 


CGT 
R 


TAT 
Y 


AAA 
K 


GCG 
A 


GAA 
E 


TAC 

Y 


GTG 
V 


GTG 
V 


ATC 
I 


CTG 
L 


GCG 
A 


GGC 
G 


GAC 
D 


CAT 
H 


ATC 
I 


432 
144 


433 
145 


TAC 
Y 


AAG 
K 


CAA 
Q 


GAC 
D 


TAC 
Y 


TCG 
S 


CGT 
R 


ATG 
M 


CTT 
L 


ATC 
I 


GAT 
D 


CAC 
H 


GTC 
V 


GAA 

E 


AAA 
K 


GGT 
G 


480 
16C 


481 
161 


GTA 
V 


CGT 
R 


TGT 
C 


ACC 
T 


GTT 
V 


GTT 
V 


TGT 
C 


ATG 
M 


CCA 
P 


GTA 
V 


CCG 
P 


ATT 
I 


GAA 
E 


GAA 
E 


GCC 
A 


TCC 
S 


528 
176 


529 
177 


GCA 
A 


TTT 
F 


GGC 
G 


GTT 
V 


ATG 
M 


GCG 
A 


GTT 
V 


GAT 
D 


GAG 
E 


AAC 
N 


GAT 
D 


AAA 
K 


ACT 
T 


ATC 
I 


GAA 
E 


TTC 
F 


576 
192 


577 
193 


GTG 
V 


GAA 
E 


AAA 
K 


CCT 
P 


GCT 
A 


AAC 
N 


CCG 
P 


CCG 
P 


TCA 
S 


ATG 
M 


CCG 
P 


AAC 
N 


GAT 
D 


CCG 
P 


AGC 
S 


AAA 

K 


624 
208 


625 


TCT 

s 


CTG 


GCG 


AGT 

S 


ATG 

M 


GGT 


ATC 
T 


TAC 

Y 


GTC 
v/ 


TTT 

F 


GAC 


GCC 
A 


GAC 


TAT 

Y 


CTG 


TAT 

V 


672 

22>j 



BNSDOC1D: <WO 9844780A1_L> 



WO 98/44780 PCT/US98/06660 

673 GAA CTG CTG GAA GAA GAC GAT CGC GAT GAG AAC TCC AGC CAC GAC TTT 720 

225 ELLEEDDRDE NSSHDF 240 

721 GGC AAA GAT TTG ATT CCC AAG ATC ACC GAA GCC GGT CTG GCC TAT GCG 76S 

241G KDLIPKITEAGLAYA 256 

769 CAC CCG TTC CCG CTC TCT TGC GTA CAA TCC GAC CCG GAT GCC GAG CCG 816 

257 HPFPLSCVQSDPDAEP 272 

817 TAC TGG CGC GAT GTG GGT ACG CTG GAA GCT TAC TGG AAA GCG AAC CTC 8 6* 

273 YWRDVGTLEAYWKANL 288 

8 65 GAT CTG GCC TCT GTG GTG CCG AAA CTG GAT ATG TAC GAT CGC AAT TGG 912 

289 DLASVVPKLDMYDRNW 30* 

913 CCA ATT CGC ACC TAC AAT GAA TCA TTA CCG CCA GCG AAA TTC GTG CAG 96C 

305 P I RTYHE.SLP PAKFVQ 32C 

961 GAT CGC TCC GGT AGC CAC GGG ATG ACC CTT AAC TCA CTG GTT TCC GGC 100 1 

321 DRSGSHGMTLNSLVSG 33c 

1009 GGT TGT GTG ATC TCC GGT TCG GTG GTG GTG CAG TCC GTT CTG TTC TCG 105- 

337 GCVISGSVVVQSVLFS 352 

1057 CGC GTT CGC GTG AAT TCA TTC TGC AAC ATT GAT TCC GCC GTA TTG TTA 1104 

353 RVRVNSFCNI DSAVLL 36£ 

1105 CCG GAA GTA TGG GTA GGT CGC TCG TGC CGT CTG CGC CGC TGC GTC ATC 1152 

369 PEVWVGRSCRLRRCVI 384 

1153 GAT CGT GCT TGT GTT ATT CCG GAA GGC ATG GTG ATT GGT GAA AAC GCA 12 OC 

385 DRACVIPEGMVI GENA 40C 

12 01 GAG GAA GAT GCA CGT CGT TTC TAT CGT TCA GAA GAA GGC ATC GTG CTG 12 4 £ 

401 EEDARRFYRS EEGIVL 416 

1249 GTA ACG CGC GAA ATG CTA CGG AAG TTA GGG CAT AAA CAG GAG CGA TAA 1296 

417 V-TREMLRKLGHKQER* 432 

1297 TGC AGG TTT TAC ATG TAT GTT CAG AGA TGT TT 

433 CR FYM YVQRC 442 



BNSDOCID: <WO 9844780A1_I_> 



WO 98/44780 
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DNASIS ***** DNA TRANSLATION LIST ***** DATE 04-03-97 

*** INPUT INFORMATION *** vj 
FILE NAME : ECGLGCWT . SEQ SEQUENCE : NORMAL 1328 BP 
CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 1328 

TRANSLATION REGION : 1 - 1326 

*** DNA TRANSLATION *** 

1 ATG GTT AGT TTA GAG AAG AAC GA1 
1MVSLEKND 

49 CCA TTG AAA TCT GTT GCC CTG AT; 
17PLKSVALI 

97 CTG AAG GAT TTA ACC AAT AAG CG7 
33 LKDLTNKR 

145 GGT AAG TTC CGC ATT ATC GAC TT1 
49GKFRIIDF 

193 GGG ATC CGT CGT ATG GGC GTG ATC 
65GIRRMGVI 

241 GTG CAG CAC ATT CAG CGC GGC TGG 
81VQHIQRGW 

289 GAG TTT GTC GAT CTG CTG CCA GCA 
97EFVDLLPA 

337 TGG TAT CGC GGC ACC GCA GAT GCG 
113 WYRGTADA 

385 CGT CGT TAT AAA GCG GAA TAC GTG 
129 RRYKAEYV 

433 TAC AAG CAA GAC TAC TCG CGT ATG 
145 YKQDYSRM 

481 GTA CGT TGT ACC GTT GTT TGT ATG 
161 VRCTVVCM 

529 GCA TTT GGC GTT ATG GCG GTT GAT 
177 AFGVMAVD 

577 GTG GAA AAA CCT GCT AAC CCG CCG 
193 VEKPANPp 

62 5 TCT CTG GCG AGT ATG GGT ATC TAC 

2&\ s L A S W Or X V 



CAC 
H 


TTA 
L 


ATG 
M 


TTG 
L 


GCG 


CGC 


CAG 

r\ 
W 


CTG 


4£ 

1*. 


CTG 
L 


GCG 
A 


GGA 
G 


GGA 
G 


CGT 

R 


GGT 
G 


ACC 

T 

X 


CGC 
rc 


9t 
J *~ 


GCA 


AAA 


CCG 
f 


GCC 

A 


GTA 

V 


CAC 
H 


TTC 
F 


GGC 
G 


144 
46 


GCG 
A 


CTG 
L 


TCT 

S 


AAC 
N 


TGC 
C 


ATC 
I 


AAC 
N 


TCC 
S 


192 

64 


ACC 
T 


CAG 
Q 


TAC 
Y 


CAG 
Q 


TCC 
S 


CAC 
H 


ACT 
T 


CTG 
L 


24C 
8C 


TCA 

S 


TTC 
F 


TTC 
F 


AAT 
N 


GAA 
E 


GAA 
E 


ATG 
M 


AAC 
N 


28S 
96 


CAG 
Q 


CAG 
Q 


AGA 
R 


ATG 
M 


AAA 
K 


GGG 
G 


GAA 
E 


AAC 
N 


336 
112 


GTC 
V 


ACC 
T 


CAA 
Q 


AAC 
N 


CTC 
L 


GAC 
D 


ATT 
I 


ATC 
I 


384 
12£ 


GTG 
V 


ATC 
I 


CTG 
L 


GCG 
A 


GGC 
G 


GAC 
D 


CAT 
H 


ATC 
I 


432 
144 


CTT 
L 


ATC 
I 


GAT 
D 


CAC 
H 


GTC 
V 


GAA 
E 


AAA 
K 


GGT 
G 


48C 
16C 


CCA 
P 


GTA 
V 


CCG 
P 


ATT 
I 


GAA 
E 


GAA 
E 


GCC 
A 


TCC 
S 


528 
17 6 


GAG 
E 


AAC 
N 


GAT 
D 


AAA 
K 


ACT 
T 


ATC 
I 


GAA 
E 


TTC 
F 


576 
192 


TCA 
S 


ATG 
M 


CCG 
P 


AAC 
N 


GAT 
D 


CCG 
P 


AGC 
S 


AAA 
K 


624 
208 


GTC 

V/ 


TTT 

F 


GAC 

0 


GCC 
A 


GAC 

D 


TAT 


CTG 


TAT 

V 


672 
224 
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ct 



673 GAA CTG CTG GAA GAA GAC GAT CGC GAT GAG AAC TCC AGC CAC GAC TTT 
225 ELLEEDDRDENSSHDF 

721 GGC AAA GAT TTG ATT CCC AAG ATC ACC GAA GCC GGT CTG GCC TAT GCG 
241 GKDLIPKITE AGLAYA 

769 CAC CCG TTC CCG CTC TCT TGC GTA CAA TCC GAC CCG GAT GCC GAG CCG 
257 HPFPLSCVQSDPDAEP 

817 TAC TGG CGC GAT GTG GGT ACG CTG GAA GCT TAC TGG AAA GCG AAC CTC 
273 YWRDVGTLEAYWKANL 

865 GAT CTG GCC TCT GTG GTG CCG GAA CTG GAT ATG TAC GAT CGC AAT TGG 
289 DLASVVPELDMYDRNW 

913 CCA ATT CGC ACC TAC AAT GAA TCA TTA CCG CCA GCG AAA TTC GTG CAG 
305PIRTYNESLPPAKFVQ 

961 GAT CGC TCC GGT AGC CAC GGG ATG ACC CTT AAC TCA CTG GTT TCC GGC 
321 DRSGSHGMTLNSLVSG 

1009 GGT TGT GTG ATC TCC GGT TCG GTG GTG GTG CAG TCC GTT CTG TTC TCG 
337 GCVXSGSVVVQSV'LFS 

1057 CGC GTT CGC GTG AAT TCA TTC TGC AAC ATT GAT TCC GCC GTA TTG TTA 
353 RVRVNSFCNIDSAVLL 

1105 CCG GAA GTA TGG GTA GGT CGC TCG TGC CGT CTG CGC CGC TGC GTC ATC 
3fi 9 PEVWVGRSCRLRRCVI 

1153 GAT CGT GCT TGT GTT ATT CCG GAA GGC ATG GTG ATT GGT GAA AAC GCA 
385 DRACVIPEGMVIGENA 

1201 GAG GAA GAT GCA CGT CGT TTC TAT CGT TCA GAA GAA GGC ATC GTG CTG 
401 EEDARRFYRSEEGIVL 

1249 GTA ACG CGC GAA ATG CTA CGG AAG TTA GGG CAT AAA CAG GAG CGA TAA 
417 VTREMLRKLGHKQER* 

1297 TGC AGG TTT TAC ATG TAT GTT CAG AGA TGT TT 
433 CRFYMYVQRC 



72C 
24C 

766 
256 

816 

272 

864 
288 

912 
304 

96C 
32C 
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Q 


GAA 
E 


ATC 
I 


ATG 
M 


AAG 

K 


CGC 
R 


ATG 
M 


ATT 
I 


TTG 
L 


864 
288 


865 
289 


TTT 
F 


TGC 
C 


AAG 
K 


GTT 
V 


GCT 
A 


GTT 
V 


GAG 
E 


GTT 
V 


CCT 
P 


TGG 
W 


CAC 
H 


GTT 
V 


CCA 
p 


TGC 

c 


GGT 
G 


GGT 
G 


912 

J U 4 


913 
305 


GTG 
V 


TGC 
C 


TAC 
Y 


GGA 
G 


GAT 
D 


GGA 
G 


AAT 
N 


TTG 
L 


GTG 
V 


TTC 
F 


ATT 
I 


GCC 
A 


ATG 
M 


AAT 
N 


TGG 
W 


CAC 


960 
32 0 


961 
321 


ACT 
T 


GCA 
A 


CTC 
L 


CTG 
L 


CCT 
P 


GTT 
V 


TAT 
Y 


CTG 
L 


AAG 
K 


GCA 
A 


TAT 
Y 


TAC 
Y 


AGA 

R 


GAC 
D 


CAT 
H 


GGG 
G 


1008 

J O D 


1009 
337 


TTA 
L 


ATG 
M 


CAG 
Q 


TAC 
Y 


ACT 
T 


CGC 
R 


TCC 
S 


GTC 
V 


CTC 
L 


GTC 
V 


ATA 
I 


CAT 
H 


AAC 
N 


ATC 
I 


GGC 
G 


CAC 
H 


1056 

J -J <L 


1057 
353 


CAG 
Q 


GGC 
G 


CGT 
R 


GGT 
G 


CCT 
P 


GTA 

V 


CAT 
H 


GAA 
E 


TTC 
F 


CCG 
P 


TAC 
Y 


ATG 
M 


GAC 
D 


TTG 
L 


CTG 
L 


AAC 
N 


1104 
368 


1105 
369 


ACT 
T 


AAC 
N 


CTT 
L 


CAA 
Q 


CAT 
H 


TTC 
F 


GAG 
E 


CTG 
L 


TAC 
Y 


GAT 
D 


CCC 
P 


GTC 
V 


GGT 
G 


GGC 
G 


GAG 
E 


CAC 
H 


1152 
384 


1153 
385 


GCC 
A 


AAC 
N 


ATC 
I 


TTT 
F 


GCC 
A 


GCG 
A 


TGT 
C 


GTT 
V 


CTG 
L 


AAG 
K 


ATG 
M 


GCA 
A 


GAC 
D 


CGG 
R 


GTG 

V 


GTG 

v 


1200 
400 


1201 
401 


ACT 
T 


GTC 
V 


AGC 

S 


CGC 
R 


GGC 
G 


TAC 
Y 


CTG 
L 


TGG 
W 


GAG 
E 


CTG 
L 


AAG 
K 


ACA 
T 


GTG 
V 


GAA 
E 


GGC 
G 


GGC 
G 


1248 
416 


1249 
417 


TGG 
W 


GGC 
G 


CTC 
L 


CAC 
H 


GAC 
D 


ATC 
I 


ATC 
I 


CGT 
R 


TCT 

S 


AAC 
N 


GAC 
D 


TGG 
W 


AAG 
K 


ATC 
I 


AAT 
N 


GGC 
G 


1296 
432 


1297 
433 


ATT 
I 


CGT 
R 


GAA 
E 


CGC 
R 


ATC 
I 


GAC 
D 


CAC 
H 


CAG 
Q 


GAG 
E 


TGG 
W 


AAC 
N 


CCC 

p 


AAG 
K 


GTG 
V 


GAC 
D 


GTG 
V 


1344 
448 


1345 
449 


CAC 
H 


CTG 
L 


CGG 
R 


TCG 
S 


GAC 
D 


GGC 
G 


TAC 
Y 


ACC 
T 


AAC 
N 


TAC 
Y 


TCC 
S 


CTC 
L 


GAG 
E 


ACA 
T 


CTC 
L 


GAC 
D 


1392 
464 


1393 
465 


GCT 
A 


GGA 
G 


AAG 
K 


CGG 
R 


CAG 
Q 


TGC 
C 


AAG 
K 


GCG 
A 


GCC 
A 


CTG 
L 


CAG 
Q 


CGG 
R 


GAC 
D 


GTG 
V 


GGC 
G 


CTG 
L 


1440 
480 


1441 
481 


GAA 
£ 


GTG 
V 


CGC 
R 


GAC 
D 


GAC 
D 


GTG 
V 


CCG 
P 


CTG 
L 


CTC 
L 


GGC 
G 


TTC 
F 


ATC 
I 


GGG 
G 


CGT 
R 


CTG 
L 


GAT 
D 


1488 
496 


1489 
497 


GGA 
G 


CAG 
Q 


AAG 
K 


GGC 
G 


GTG 
V 


GAC 
D 


ATC 
I 


ATC 
I 


GGG 
G 


GAC 
D 


GCG 
A 


ATG 
M 


CCG 
P 


TGG 
W 


ATC 
I 


GCG 
A 


1536 
512 


1537 
513 


GGG 
G 


CAG 
Q 


GAC 
D 


GTG 
V 


CAG 
Q 


CTG 
L 


GTG 
V 


ATG 
M 


CTG 
L 


GGC 
G 


ACC 
T 


GGC 
G 


CCA 
P 


CCT 
P 


GAC 
D 


CTG 
L 


1584 
528 


1585 

52<? 


GAA 
b 


CGA 


ATG 

M 


CTG 
U 


CAG 


CAC 

w 


TTG 

u 


GAG 
E 


CGG 

R 


GAG 
B 


CAT 

B 


CCC 

f 


AAC 

H 


AAG 
< 


GTG 
V 


CGC 

R 


1632 
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1633 
545 


GGG 
G 


TGG 
W 


GTC 
V 


GGG 
G 


TTC 
F 


TCG 

s 


GTC 
V 


CTA 
L 


ATG 
H 


GTG 
V 


CAT 
H 


CGC 

R 


ATC 
I 


ACG 
T 


CCG 
p 


GGC 
G 

w 


168C 
sou. 


1681 
561 


GCC 
A 


AGC 
S 


GTG 
V 


CTG 
L 


GTG 
V 


ATG 
H 


ccc 
p 


TCC 

s 


CGC 
R 


TTC 
F 


GCC 
A 


GGC 
G 


GGG 
G 


CTG 


AAC 
N 


CAG 

o 


1726 
576 


1729 
— > / / 


CTC 

T 
i-l 


TAC 
v 


GCG 

A 


ATG 
n 


GCA 

A 


TAC 
v 


GGC 
G 


ACC 

T 


GTC 
V 


CCT 
p 


GTG 
V 


GTG 
V 


CAC 
H 


GCC 
A 


GTG 
V 


GGC 
G 


1776 
592 


1777 
593 


GGG 
G 


CTC 
L 


AGG 
R 


GAC 
D 


ACC 
T 


GTG 
V 


GCG 
A 


CCG 
P 


TTC 
F 


GAC 
D 


CCG 
P 


TTC 
F 


GGC 
G 


GAC 
D 


GCC 
A 


GGG 
G 


1824 
608 


1825 
609 


CTC 
L 


GGG 
G 


TGG 
W 


ACT 
T 


TTT 
F 


GAC 
D 


CGC 
R 


GCC 
A 


GAG 
E 


GCC 
A 


AAC 
N 


AAG 

K 


CTG 
L 


ATC 
I 


GAG 
E 


GTG 
V 


1872 
624 


1873 
625 


CTC 
L 


AGC 
S 


CAC 
H 


TGC 
C 


CTC 
L 


GAC 
D 


ACG 
T 


TAC 
Y 


CGA 
R 


AAC 
N 


TAC 
Y 


GAG 
E 


GAG 
E 


AGC 
S 


TGG 
W 


AAG 
K 


1920 
64C 


1921 
641 


AGT 
S 


CTC 
L 


CAG 
Q 


GCG 
A 


CGC 
R 


GGC 
G 


ATG 
M 


TCG 

S 


CAG 
Q 


AAC 
N 


CTC 
L 


AGC 
S 


TGG 
W 


GAC 
D 


CAC 
H 


GCG 
A 


196S 
656 


1969 
657 


GCT 
A 


GAG 
E 


CTC 
L 


TAC 
Y 


GAG 
E 


GAC 
D 


GTC 
V 


CTT 
L 


GTC 
V 


AAG 
K 


TAC 
Y 


CAG 
Q 


TGG 
W 








; 2007 
669 
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T*&SIS ***** DNA TRANSLATION LIST ***** 

*** INPUT INFORMATION *** 

FILE NAME r MSS1B.SEQ SEQUENCE : NORMAL 1749 BP 

CODON TABLE : UNIV.TCN 

SEQUENCE REGION : i - 1749 

TRANSLATION REGION : 1 - 1749 



DATE 04-03-97 



*** DNA TRANSLATION *** 



1 
1 


TGC 

c 


GTC 
V 


GCG 
A 


GAG 
E 


CTG 

T. 
J_l 


AGC 
0 


AGG 
R 


GAG 
E 


GGG 
G 


CCC 
p 


GCG 
A 


CCG 
P 


CGC 
R 


CCG 
P 


CTG 
L 


CCA 
P 


48 
16 


49 
17 


CCC 

p 


GCG 
A 


CTG 
L 


CTG 
L 


GCG 


CCC 
p 

XT 


CCG 

•n 
f 


CTC 
Li 


GTG 
V 


CCC 
p 


GGC 
G 


TTC 
F 


CTC 
L 


GCG 
A 


CCG 
P 


CCG 
P 


96 
32 


97 
33 


GCC 
A 


GAG 
E 


CCC 
P 


ACG 
T 


GGT 
G 


GAG 
E 


CCG 
P 


GCA 
A 


TCG 

s 


ACG 
T 


CCG 
p 


CCG 
p 


CCC 
p 


GTG 
v 


CCC 
f 


GAC 

D 


144 
48 


145 
49 


GCC 
A 


GGC 
G 


CTG 
L 


GGG 
G 


GAC 
D 


CTC 
L 


GGT 
G 


CTC 
L 


GAA 
E 


CCT 
P 


GAA 
E 


GGG 
G 


ATT 
I 


GCT 
A 


GAA 
E 


GGT 
G 


192 
64 


193 
65 


TCC 
S 


ATC 
I 


GAT 
D 


AAC 
N 


ACA 
T 


GTA 

V 


GTT 
V 


GTG 

V 


GCA 
A 


AGT 
S 


GAG 
E 


CAA 
Q 


GAT 
D 


TCT 
S 


GAG 
E 


ATT 
I 


240 
80 


241 
81 


GTG 
V 


GTT 
V 


GGA 
G 


AAG 
K 


GAG 
E 


CAA 
Q 


GCT 
A 


CGA 
R 


GCT 
A 


AAA 
K 


GTA 
V 


ACA 
T 


CAA 
Q 


AGC 
S 


ATT 
I 


GTC 

V 


288 
96 


289 
97 


TTT 
F 


GTA 
V 


ACC 
T 


GGC 
G 


GAA 
E 


GCT 
A 


TCT 

S 


CCT 
P 


TAT 
Y 


GCA 
A 


AAG 
K 


TCT 
S 


GGG 
G 


GGT 
G 


CTA 
L 


GGA 
G 


336 
112 


337 
113 


GAT 
D 


GTT 
V 


TGT 
C 


GGT 
G 


TCA 
S 


TTG 
L 


CCA 
P 


GTT 
V 


GCT 
A 


CTT 
L 


GCT 
A 


GCT 
A 


CGT 
R 


GGT 
G 


CAC 
H 


CGT 
R 


384 
128 


385 
129 


GTG 
V 


ATG 
M 


GTT 

V 


GTA 
V 


ATG 
M 


CCC 

p 


AGA 
R 


TAT 
Y 


TTA 
L 


AAT 
N 


GGT 
G 


ACC 
T 


TCC 
S 


GAT 
D 


AAG 
K 


AAT 
N 


432 
144 


433 
145 


TAT 
Y 


GCA 
A 


AAT 
N 


GCA 
A 


TTT 
F 


TAC 
Y 


ACA 
T 


GAA 
E 


AAA 
K 


CAC 
H 


ATT 
I 


CGG 
R 


ATT 
I 


CCA 
P 


TGC 
C 


TTT 
F 


480 
160 


481 
161 


GGC 
G 


GGT 
G 


GAA 
E 


CAT 
H 


GAA 
E 


GTT 
V 


ACC 
T 


TTC 
F 


TTC 
F 


CAT 
H 


GAG 
E 


TAT 
Y 


AGA 
R 


GAT 
D 


TCA 

S 


GTT 
V 


528 
176 


529 
177 


GAC 
D 


TGG 
W 


GTG 
V 


TTT 
F 


GTT 
V 


GAT 
D 


CAT 
H 


CCC 
P 


TCA 
S 


TAT 
Y 


CAC 
H 


AGA 
R 


CCT 
P 


GGA 
G 


AAT 
N 


TTA 
L 


576 
192 


577 
193 


TAT 
Y 


GGA 
G 


GAT 
D 


AAG 
K 


TTT 
F 


GGT 
G 


GCT 
A 


TTT 
F 


GGT 
G 


GAT 
D 


AAT 
N 


CAG 
Q 


TTC 
F 


AGA 
R 


TAC 

Y 


ACA 
T 


624 
208 


625 


CTC 


CTT 


TGC 


TAT 


GCT 












mm^ 








mm'* 






20^ 


L 


L 


c 


Y 


A 


A 


c 


b 


A 


P 


u 


X 


L- 


EL 


L 


Cr 


22M 
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W/1€> 



673 
225 


GGA 
G 


TAT 
Y 


ATT 
I 


TAT 
Y 


GGA 
G 


CAG 
Q 


AAT 
N 


TGC 
C 


ATG 
M 


TTT 
F 


GTT 
V 


GTC 
V 


AAT 
N 


GAT 
D 


TGG 
W 


CAT 
H 


720 
240 


721 
241 


GCC 
A 


AGT 
S 


CTA 
L 


GTG 
V 


CCA 
P 


GTC 
V 


CTT 
L 


CTT 
L 


GCT 
A 


GCA 
A 


AAA 

K 


TAT 
Y 


AGA 
R 


CCA 
P 


TAT 
Y 


GGT 
G 


768 
256 


769 
257 


GTT 
V 


TAT 
Y 


AAA 

K 


GAC 
D 


TCC 
S 


CGC 
R 


AGC 
S 


ATT 
I 


CTT 
L 


GTA 
V 


ATA 
I 


CAT 
H 


AAT 
N 


TTA 
L 


GCA 
A 


CAT 
H 


816 

272 


817 
273 


CAG 
Q 


GGT 
G 


GTA 
V 


GAG 
E 


CCT 
P 


GCA 
A 


AGC 
S 


ACA 
T 


TAT 
Y 


CCT 
P 


GAC 
D 


CTT 
L 


GGG 
G 


TTG 
L 


CCA 
P 


CCT 
P 


864 
288 


865 
289 


GAA 
£ 


TGG 
W 


TAT 

Y 


GGA 
G 


GCT 
A 


CTG 
L 


GAG 
E 


TGG 
W 


GTA 
V 


TTC 
F 


CCT 
P 


GAA 
E 


TGG 
W 


GCG 
A 


AGG 
R 


AGG 
R 


912 
304 


913 
305 


CAT 
H 


GCC 
A 


CTT 
L 


GAC 
D 


AAG 

K 


GGT 
G 


GAG 
E 


GCA 
A 


GTT 
V 


AAT 
N 


TTT 
F 


TTG 
L 


AAA 

K 


GGT 
G 


GCA 
A 


GTT 
V 


960 
320 


961 
321 


GTG 
V 


ACA 
T 


GCA 
A 


GAT 
D 


CGA 
R 


ATC 
I 


GTG 
V 


ACT 
T 


GTC 
V 


AGT 
S 


AAG 
K 


GGT 
G 


TAT 
Y 


TCG 
S 


TGG 
W 


GAG 
E 


1008 
336 


1009 
337 


GTC 
V 


ACA 
T 


ACT 
T 


GCT 
A 


GAA 
E 


GGT 
G 


GGA 
G 


CAG 
Q 


GGC 
G 


CTC 
L 


AAT 
N 


GAG 
E 


CTC 
L 


TTA 
L 


AGC 
S 


TCC 
S 


1056 
352 


1057 
353 


AGA 
R 


AAG 
K 


AGT 
S 


GTA 
V 


TTA 
L 


AAC 
N 


GGA 
G 


ATT 
I 


GTA 
V 


AAT 
N 


GGA 
G 


ATT 
I 


GAC 
D 


ATT 
I 


AAT 
N 


GAT 
D 


1104 
368 


1105 
369 


TGG 
W 


AAC 
N 


CCT 
P 


GCC 
A 


ACA 
T 


GAC 
D 


AAA 
K 


TGT 
C 


ATC 
I 


CCC 
P 


TGT 
C 


CAT 
H 


TAT 
Y 


TCT 
S 


GTT 
V 


GAT 
D 


1152 
384 


1153 
385 


GAC 
D 


CTC 
L 


TCT 
S 


GGA 
G 


AAG 

K 


GCC 
A 


AAA 

K 


TGT 
C 


AAA 
K 


GGT 
G 


GCA 
A 


TTG 
L 


CAG 
Q 


AAG 
K 


GAG 
E 


CTG 
L 


1200 
400 


1201 
401 


GGT 
G 


TTA 
L 


CCT 
P 


ATA 
I 


AGG 
R 


CCT 
P 


GAT 
D 


GTT 
V 


CCT 
P 


CTG 
L 


ATT 
I 


GGC 
G 


TTT 
F 


ATT 
I 


GGA 
G 


AGG 
R 


1248 
416 


1249 
417 


TTG 
L 


GAT 
D 


TAT 
Y 


CAG 
Q 


AAA 

K 


GGC 
G 


ATT 
I 


GAT 
D 


CTC 
L 


ATT 
I 


CAA 
Q 


CTT 
L 


ATC 
I 


ATA 
I 


CCA 
P 


GAT 
D 


1296 
432 


1297 
433 


CTC 
L 


ATG 
M 


CGG 
R 


GAA 
E 


GAT 
D 


GTT 
V 


CAA 
Q 


TTT 
F 


GTC 
V 


ATG 
M 


CTT 
L 


GGA 
G 


TCT 

S 


GGT 
G 


GAC 
D 


CCA 
P 


1344 
448 


1345 
449 


GAG 
E 


CTT 
L 


GAA 
E 


GAT 
D 


TGG 
W 


ATG 
M 


AGA 
R 


TCT 
S 


ACA 
T 


GAG 
E 


TCG 
S 


ATC 
I 


TTC 
F 


AAG 
K 


GAT 
D 


AAA 
K 


1392 
464 


1393 
465 


TTT 
F 


CGT 
R 


GGA 
G 


TGG 
W 


GTT 
V 


GGA 
G 


TTT 
F 


AGT 
S 


GTT 
V 


CCA 
P 


GTT 
V 


TCC 
S 


CAC 
H 


CGA 
R 


ATA 
I 


ACT 
T 


1440 
480 


1441 
481 


GCC 
A 


GGC 
G 


TGC 
C 


GAT 
D 


ATA 
I 


TTG 
L 


TTA 
L 


ATG 
M 


CCA 
P 


TCC 
S 


AGA 
R 


TTC 
F 


GAA 
E 


CCT 
P 


TGT 
C 


GGT 
G 


1488 
496 


1489 
497 


CTC 
L 


AAT 
N 


CAG 
Q 


CTA 
L 


TAT 
Y 


GCT 
A 


ATG 
M 


CAG 
Q 


TAT 
Y 


GGC 
G 


ACA 
T 


GTT 
V 


CCT 
P 


GTT 
V 


GTC 
V 


CAT 
H 


1536 
512 


1537 
513 


GCA 
A 


ACT 
T 


GGG 
G 


GGC 
G 


CTT 
L 


AGA 
R 


GAT 
D 


ACC 
T 


GTG 
V 


GAG 
E 


AAC 
N 


TTC 
F 


AAC 
N 


CCT 
P 


TTC 

F 


GGT 
G 


1584 
528 


52q 


GAG 

B 


(\f 


GGA 

Or 


GAG 




GGT 
Or 


T 


GHG 
Gr 


TG^ 

VvJ 


GGA 

A 


P 


art 

A 


TP 
P 


U 


arr 

T 


AC* 

*T 


1632 
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^ A 



1633 GAA AAC ATG TTT GTG GAC ATT GCG AAC TGC AAT ATC TAC ATA CAG GGA 1680 

545 ENMFVDIANCNIYIQG 560 

1681 ACA CAA GTC CTC CTG GGA AGG GCT AAT GAA GCG AGG CAT GTC AAA AGA 1728 

561 T QVLLGRANEARHVKR 576 

1729 CTT CAC GTG GGA CCA TGC CGC 1749 

577 L H V G P C R 583 
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TRANSLATE of: mb73be2.seq check: 2027 from: 6 to: 2640 -J 
generated symbols 1 to: 878. 

LOCUS M2EGLUCTRN 2640 bp ss-mRNA PLN May 20,1993 

DEFINITION Maize starch branching enzyme II mRNA, complete cds. 

KEYWORDS 1, 4-alpha-glucan branching enzyme; amylo-transglycosylase; 

glucanotransf erase; starch branching enzyme II. 
SOURCE Zea mays (cultivar B73) 30 days post pollenation endosperm . . . 

Mb73be2.pep Length: 799 Transit peptide 1-61 May 20, 1993 13:29 Type: P 
Check: 2844 

1 MAFRVSGAVL GGAVRAPRLT GGGEGSLVFR HTGLFLTRGA RVGCSGTHGA 

51 MRAAAAARKA VMVPEGENDG LASRADSAQF QSDELEVPDI SEETTCGAGV 

101 ADAQALNRVR WPPPSDGQK IFQIDPMLQG YKYHLEYRYS LYRRIRSDID 

151 EHEGGLEAFS RSYEKFGFNR SAEGITYREW APGAFSAALV GDFNNWDPNA 

201 DRMSKNEFGV WEIFLPNNAD GTSPIPHGSR VKVRMDTPSG IKDSIPAWJK 

251 YSVOAPGETP YncTVVnPPF EVKYVFRHAQ PKRPKSLRIY ETHVGMSSPE 

3 01 PKINTYVNFR DEVT.PRTTOCT. GYNAVQIMAI QEHSYYGSFG YHVTNFFAPS 

351 SRFGTPEELK SLIDRAHELG LLVLMDWHS HASSNTLDGL NGFDGTDTHY 

401 FHSGPRGHHW MWDSRLFNYG NWEVLRFLLS NARWWLEEYK FDGFRFDGVT 

451 SMMYTHHGLQ VTFTGNFNEY FGFATDVDAV VYLMLVNDLI HGLYPEAVTI 

501 GEDVSGMPTF ALPVHDGGVG FDYRMHMAVA DKWIDLLKQS DETWKMGDIV 

551 HTLTNRRWLE KCVTYAESHD QALVGDKTIA FWLMDKDMYD FMALDRPSTP 

601 TIDRGIALHK MIRLITMGLG GEGYLNFMGN EFGHPEWIDF PRGPQRLPSG 

651 KFIPGNNNSY DKCRRRFDLG DADYLRYHGM QEFDQAMQHL EQKYEFMTSD 

701 HQYISRKHEE DKVIVFEKGD LVFVFNFHCN NSYFDYRIGC RKPGVYKWL 

751 DSDAGLFGGF SRIHHAAEHF TADCSHDNRP YSFSVYTPSR TCWYAPVE* 

801 *RGTRCCAAC VGLSM*GKTF FQNRQMHACM LQ*GSDTLID AGKPMHLAAL 

851 SSLFI*DLQG VN*T*SFRFS LKKKKKKL 
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LOCUS MZEGLUCTRN 2640 bp ss-mRNA PLN May 20 1993 

DEFINITION Maize starch branching enzyme II mRNA, complete cds. 

KEYWORDS 1,4-alpha-glucan branching enzyme;, amy lo-transglycosy lase ; 

glucanotransf erase; starch branching enzyme II. 
SOURCE Zea mays (cultivar B73) 30 days post pollenation endosperm 

cDNA to mRNA. 
ORGANISM Zea mays 

STANDARD full automatic 
FEATURES Location/Qualifiers 
sig_peptide 6. .188 

/codon_start=3 
mat_peptide 189. .2402 

/product=" starch branching enzyme II" 
/codon_start=3 
CDS 6. .2402 

/EC_number="2 . 4 . 1. 18" 

/product=" starch branching enzyme II" 
/ codon_start=3 

mb73be2.seq Length: 2640 May 20, 1993 13:28 Type: N Check: 2027 

GCGAGATGGCGTTCCGGGTTTCTGGGGCGGTGCTCGGTGGGGCCGTAAGGGCTCCCCGACTCACCGGCGGCGGGG 
AGGGTAGTCTAGTCTTCCGGCACACCGGCCTCTTCTTAACTCGGGGTGCTCGAGTTGGATGTTCGGGGACGCACG 
GGGCCATGCGCGCGGCGGCCGCGGCCAGGAAGGCGGTCATGGTTCCTGAGGGCGAGAATGATGGCCTCGCATCAA 
GGGCTGACTCGGCTCAATTCCAGTCGGATGAACTGGAGGTACCAGACATTTCTGAAGAGACAACGTGCGGTGCTG 
GTGTGGCTGATGCTCAAGCCTTGAACAGAGTTCGAGTGGTCCCCCCACCAAGCGATGGACAAAAAATATTCCAGA 
TTGACCCCATGTTGCAAGGCTATAAGTACCATCTTGAGTATCGGTACAGCCTCTATAGAAGAATCCGTTCAGACA 
TTGATGAACATGAAGGAGGCTTGGAAGCCTTCTCCCGTAGTTATGAGAAGTTTGGATTTAATCGCAGCGCGGAAG 
GTATCACATATCGAGAATGGGCTCCTGGAGCATTTTCTGCAGCATTGGTGGGTGACTTCAACAACTGGGATCC^ 
ATGCAGATCGTATGAGCAAAAATGAGTTTGGTGTTTGGGAAATTTTTCT 

CTATTCCTCATGGATCTCGTGTAAAGGTGAGAATGGATACTCCATCAGGGATAAAGGATTCAATTCCAGCCTGGA 

TCAAGTACTCAGTGCAGGCCCCAGGAGAAATACCATATGATGGGATTTATTATGATCCTCCTGAAGAGGTAAAGT 

ATGTGTTCAGGCATGCGCAACCTAAACGACCAAAATCATTGCGGATATATGAAACACATGTCGGAATGAGTAGCC 

CGGAACCGAAGATAAACACATATGTAAACTTTAGGGATGAAGTCCTCCCAAGAATAAAAAAACTTGGATACAATG 

CAGTGCAAATAATGGCAATCCAAGAGCACTCATATTATGGAAGCTTTGGATACCATGTAACTAATTTTTCT 

CAAGTAGTCGTTTTGGTACCCCAGAAGAATTGAAGTCTTTGATTGATAGAGCACATGAGCTTGGTTTGCTAGTTC 

TCATGGATGTGGTTCATAGTCATGCGTCAAGTAATACTCTGGATGGGTTGAATGGTTTTGATGGTACAGATACAC 

A^MLCTTTCACAGTGGTCCACGTGGCCATCACTGGATGTGGGATTCTCGCCTATTTAACTATGGGAACTGGGAAG 

pTTAAGATTTCTTCTCTCCAATGCTAGATGGTGGCTCGAGGAATATAAGTTTGATTC 

TGACCTCCATGATGTACACTCATCACGGATTACAAGTAACATTTACGGGGAACTTCAATGAGTATTTTGGCTTTG 
C^CCGATGTAGATGCAGTGGTTTACTTGATGCTGGT^ 

CCATTGGTGAAGATGTTAGTGGAATGCCTACATTTGCCCTTCCTGTTCACGATGGTGGGGTAGGTTTTGACTATC 
^« GCATATGGCTGTG ^ 

TraGTGCACACACTGACAAATAGGAGGTGGTTAGAGAAGTGTGTAACTTATGCTGAAAGTCATGATCAAGCATTAG 
T^GGCGACAAGACTATTGCGTTTTGGTTGATGGACAAGGATATGTATGATTTCATGGCCCTCGATAGACCTTCAA 
S^^ ACCATOGATCGTGGGATAG< ^^ 

ATCTTAATTTCATGGGAAATGAGTTTGGACATCCTGAATGGATAGATTTTCCAAGAGGTCCGCAAAGACTTCC^ 
G £ G £^^ 

ATCTTAGGTATCATGGTATGCAAGAGTTTGATCATCCAA^^ 
G *55 rCACCAGTATA '^ 

TCTT<^CTTC(^CTGC^CAACAGCTATTTTGACTACCGTAOT^ 
•^^ GACTCCGACGCT ^ 

GTTCGCATGATAATAGGCCATATTCATTCTCGGTTTATACACCAAGCAGAACATGTGTCGTCTATGCTCCAGTGG 

AG * GATAGCGGGGTACTCGTOGCT ^ 

G ?!; AGATG ^ TG ^ TC ^ TGCTA ^ 
CGTTGTCCTCTCTATTTATAT^ 

AAAAAAAAAAACTCA 
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Maize Branching Enzyme I Mature Protein Sequence 



ATVQEDKTMA 


TAKGDVDHLP 


IYDLDPKLEI 


FKDHFRYRMK 


RFLEQKGSIE 


51 


ENEGSLESFS 


KGYLKFGINT 


NEDGTVYREW 


APAAQEAELI 


GDFNDWNGAN 


101 


HKMEKDKFGV 


WSIKIDHVKG 


KPAIPHNSKV 


KFRFLHGGVW 


VDRIPALIRY 


151 


ATVDASKFGA 


PYDGVHWDPP 


ASERYTFKHP 


RPSKPAAPRI 


YEAHVGMSGE 


201 


KPAVSTYREF 


ADNVLPRIRA 


NNYNTVQLMA 


VMEHSYYASF 


GYHVTNFFAV 


251 


SSRSGTPEDL 


KYLVDKAHSL 


GLRVLMDWH 


SHASNNVTDG 


LNGYDVGQST 


301 


QESYFHAGDR 


GYHKLWDSRL 


FNYANWEVLR 


FLLSNLRYWL 


DEFMFDGFRF 


351 


DGVTSMLYHH 


HGINVGFTGN 


YQEYFSLDTA 


VDAWYMMLA 


NHLMHKLLPE 


401 


ATWAEDVSG 


MPVLCRPVDE 


GGVGFDYRLA 


MAIPDRWIDY 


LKNKDDSEWS 


451 


MGEIAHTLTN 


RRYTEKCIAY 


AESHDQSIVG 


DKTIAFLLMD 


KEMYTGMSDL 


501 


QPASPTIDRG 


IALQKMIHFI 


TMAXjGGDGYIi 


NFMGNEFGHP 


EWIDFPREGN 


551 


NWS YDKCRRQ 


WSLVDTDHLR 


YKYMNAFDQA 


MNALDERFSF 


LSSSKQIVSD 


601 


MNDEEKVIVF 


ERGDLVFVFN 


FHPKKTYEGY 


KVGCDLPGKY 


RVALDSDALV 


651 


FGGHGRVGHD 


VDHFTSPEGV 


PGVPETNFNN 


RPNSFKVLSP 


PRTCVAYYRV 


701 


DEAGAGRRLH 


AKAETGKTSP 


AESIDVKASR 


ASSKEDKEAT 


AGGKKGWKFA 


751 



RQPSDQDTK 
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Kg 54 



ID D11081 unannotated; RNA; UNA; 2763 BP. 

XX 

AC D11081; 
XX 

DT 13-JUL-1992 (Rel. 32 , Created) 

DT 13-JUL-1992 (Rel. 32, Last updated, Version 1) 

DE Zea mays L. maize branching enzyme-I cDNA to mRNA. 

FH Key Location/Qualifiers 

FH 

XX 

SQ Sequence 2763 BP; 719 A; 585 C; 737 G; 722 T; 0 other; 

D11081 Length: 2763 February 4, 1993 11: 16 Type: N Check: 6169 



1 


GCTGTGCCTC GTGTCGCCCT 


CTTCCTCGCC 


GACTCCGCTT 


CCGCCGCCGC 


51 


GGCGCTCTCG CTCGCATG CT 


GATCGGGCGG 


CACCGCCGGG 


GATCGCGGGT 


101 


GGCGGCAATG TGCGCCTGAG 


TGTGTTGTCT 


GTCCAGTGCA 


AGGCTCGCCG 


151 


GTCAGGGGTG CGGAAGGTCA 


AGAGCAAATT 


CGCCACTGCA 


GCTACTGTGC 


201 


AAGAAGATAA AACTATGGCA ACTGCCAAAG 


GCGATGTCGA 


CCATCTCCCC 


251 


ATATACGACC TGGACCCCAA 


GCTGGAGATA 


TTCAAGGACC 


ATTTCAGGTA 


301 


CCGGATGAAA AGATTCCTAG 


AGCAGAAAGG 


ATCAATTGAA 


GAAAATGAGG 


351 


GAAGTCTTGA ATCTTTTTCT AAAGGCTATT 


TGAAATTTGG 


GATTAATACA 


401 


AATGAGGATG GAACTGTATA 


TCGTGAATGG 


GCACCTGCTG 


CGCAGGAGGC 


451 


AGAGCTTATT GGTGACTTCA 


ATGACTGGAA 


TGGTG CAAAC 


CATAAGATGG 


501 


AGAAGGATAA ATTTGGTGTT 


TGGTCGATCA 


AAATTGACCA 


TGTCAAAGGG 


551 


AAACCTGCCA TCCCTCACAA TTCCAAGGTT 


AAATTTCG CT 


TTCTACATGG 


601 


TGGAGTATGG GTTGATCGTA 


TTCCAGCATT 


GATTCGTTAT 


GCGACTGTTG 


651 


ATGCCTCTAA ATTTGGAGCT 


CCCTATGATG 


GTGTTCATTG 


GGATCCTCCT 


701 


GCTTCTGAAA GGTACACATT 


TAAGCATCCT 


CGGCCTTCAA 


AGCCTGCTGC 


751 


TCCACGTATC TATGAAGCCC 


ATGTAGGTAT 


GAGTGGTGAA 


AAGCCAGCAG 


801 


TAAGCACATA TAG GG AATTT 


GCAGACAATG 


TGTTGCCACG 


CATACGAGCA 


851 


AATAACTACA ACACAGTTCA 


GTTGATGGCA 


GTTATGGAGC 


ATTCGTACTA 


901 


TGCTTCTTTC GGGTACCATG 


TGACAAATTT 


CTTTG CGGTT 


AGCAGCAGAT 


951 


CAGGCACACC AGAGGACCTC 


AAATATCTTG 


TTGATAAGGC 


ACACAGTTTG 


1001 


GGTTTGCGAG TTCTGATGGA 


TGTTGTCCAT 


AGCCATGCAA 


GTAATAATGT 
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1101 


ATTTTCATGC 


GGGAGATAGA 


1151 


TTCAACTATG 


CTAACTGGGA 


1201 


ATATTGGTTG 


GATGAATTCA 


1251 


CATCAATGCT 


GTATCATCAC 


1301 


TACCAGGAAT 


ATTTCAGTTT 


1351 


GATG CTTGCA 


AACCATTTAA 


1401 


TTGCTGAAGA 


TGTTTCAGGC 


1451 


GGTGGGGTTG 


GGTTTGACTA 


1501 


GATTGACTAC 


CTGAAGAATA 


1551 


TAGCGCATAC 


TTTGACTAAC 


1601 


GCTGAGAGCC 


ATGATCAGTC 


1651 


CCTGATGGAC 


AAGGAAATGT 


1701 


CACCTACAAT 


TGATCGAGGG 


1751 


ACAATGGCCC 


TTGGAGGTGA 


1801 


TGGTCACCCA 


GAATGGATTG 


1851 


ATGATAAATG 


CAGACGACAG 


1901 


TACAAGTACA 


TGAATGCGTT 


1951 


ATTTTCCTTC 


CTTTCGTCGT 


2001 


AGGAAAAGGT 


TATTGTCTTT 


2051 


TTCCATCCCA 


AGAAAACTTA 


2101 


TGGGAAATAC 


AGAGTAGCCC 


2151 


ATGGAAGAGT 


TGGCCACGAC 


2201 


CCAGGGGTGC 


CCGAAACGAA 


2251 


CCTTTCTCCG 


CCCCGCACCT 


2301 


GGGCTGGACG 


ACGTCTTCAC 


2351 


GCAGAGAGCA 


TCGACGTCAA 


2401 


GGAGGCAACG 


GCTGGTGGCA 


2451 


CCGATCAAGA 


TACCAAATGA 


2501 


GCTGCCGGCG 


CCCTGTTAGT 


2551 


GCGCCCTTGG 


AACGGTCCTT 



UGXTAT CAT A AACTTTGGGA I'AiiXC^Ub 
GGTATTAAGG TTTCTTCTTT CTAACCTGAG 
TGTTTGATGG CTTCCGATTT GATGGAGTTA 
CATGGTATCA ATGTGGGGTT TACTGGAAAC 
GGACACAGCT GTGGATGCAG TTGTTTACAT 
TGCACAAACT CTTGCCAGAA GCAACTGTTG 
ATGCCGGTCC TTTGCCGGCC AGTTGATGAA 
TCGCCTGGCA ATGGCTATCC CTGATAGATG 
AAGATGACTC TGAGTGGTCG ATGGGTGAAA 
AGGAGATATA CTGAAAAATG CATCGCATAT 
TATTGTTGGC GACAAAACTA TTGCATTTCT 
ACACTGGCAT GTCAGACTTG CAGCCTGCTT 
ATTGCACTCC AAAAGATGAT TCACTTCATC 
TGGCTACTTG AATTTTATGG GAAATGAGTT 
ACTTTCCAAG AGAAGGGAAC AACTGGAGCT 
TGGAGCCTTG TGGACACTGA TCACTTGCGG 
TGACCAAGCG ATGAATGCGC TCGATGAGAG 
CAAAGCAGAT CGTCAGCGAC ATGAACGATG 
GAACGTGGAG ATTTAGTTTT TGTTTTCAAT 
CGAGGGCTAC AAAGTGGGAT GCGATTTGCC 
TGGACTCTGA TGCTCTGGTC TTCGGTGGAC 
GTGGATCACT TCACGTCGCC TGAAGGGGTG 
CTTCAACAAC CGGCCGAACT CGTTCAAAGT 
GTGTGGCTTA TTACCGTGTA GACGAAGCAG 
GCGAAAGCAG AGACAGGAAA GACGTCTCCA 
AGCTTCCAGA GCTAGTAGCA AAGAAGACAA 
AGAAGGGATG GAAGTTTGCG CGGCAGCCAT 
AGCCACGAGT CCTTGGTGAG GACTGGACTG 
AGTCCTGCTC TACTGGACTA GCCGCCGCTG 
TCCTGTAGCT TGCAGGCGAC TGGTGTCTCA 
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2601 TCACCGAGCA GGCAGGCACT GCTTGTATAG CTTTTCTAGA ATAATAATCA 

2651 GGGATGGATG GATGGTGTGT ATTGGCTATC TGGCTAGACG TGCATGTGCC 

2701 CAGTTTGTAT GTACAGGAGC AGTTCCCGTC CAGAATAAAA AAAAACTTGT 

2751 TGGGGGGTTT TTC 
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82-/^0 



3 



6^ 



ONASIS ***** DNA TRANSLATION LIST ***** ^DATE 04-03-97 

*** INPUT INFORMATION *** 

FILE NAME : MSTS1TP.SEQ SEQUENCE : NORMAL 153 BP 
CODON TABLE : UNIV.TCN 

SEQUENCE REGION : 1 - 153 

TRANSLATION REGION : 1 - 153 

*** DNA TRANSLATION *** 

1 ATG GCG ACG CCC TCG GCC GTG GGC GCC GCG TGC CTC CTC CTC GCG CGG 48 
1MATPSAV GAACLLLAR 16 

49 GCC GCC TGG CCG GCC GCC GTC GGC GAC CGG GCG CGC CCG CGG CGG CTC 96 
17AAWPAAVGDRARPRRL 32 

97 CAG CGC GTG CTG CGC CGC CGG TGC GTC GCG GAG CTG AGC AGG GAG GGG 144 
33QRVLRRRCVAELSREG 48 

145 CCC CAT ATG 153 
49 P H M 51 
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l 2 3 4 S 6 7 8 9 10 11 12 13 14 13 16 17 IS 1920 2! 22 



1. Marker. 

2. PCRofWTDNAforglgC3: 
4. PCR of WT DNA for SSUa-2; 
6. PCRofC3-9DNAforgIgC3'. 
8. PCRofC3-35DNAforg!gC3; 
10. PCR of C3-Ia-2 DNA for SSI-: 
12. PCR of C3-U-3 DNA for SSI-2; 
14. PCR of C3-Ia-l DNA for SSI— 2; 
16. PCR of C3-k-5 DNA for SSI— 2; 
18. PCR of C3JIa-2 DNA for SSIla-2; 
20. PCR of C3-0a-5 DNA for SSlIa-2; 
22. PCR of C3-n*-12 DNA for SSlIa-2. 



3. PCR ofWT DNA for SSI-2; 
5. PCRofC3-6DNAfor6lgC3; 
7. PCRofC3-29DNAforgIgC3; 
9. PCRofC3-U-2DNAforglgC3 
1 1 . PCR of C3-Ia-3 DNA for glgC3; 
13. PCR of C3-I*-* DNA for glgC3 
15. PCR of C3-la-S DNA for g]gC3 
17. PCR of C3-Ear2 DNA for glgC3 
19. PCR of C3-Ho-5 DNA for glfcC3: 
2 1 . PCR of C3-IIa-t2 DNA for glgC3 
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transformed plant cells and plants, plant-functional regulatory sequences, and processes for plant cell transformation and 
whole plant regeneration; these features are not required by any other group. 

The invention of Group III, drawn to a third product, shares the technical feature of requiring mutant or gene- or 
enzyme- inactivated host cells, and methods for such mutation or gene- or enzyme- inactivation; these features are not 
required by any other group. 

The invention of Group IV, drawn to a fourth product, shares the technical feature of being isolated polysaccharides, this 
feature is not required by any other group. 
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